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Preface to the new edition 


The book “Theory of Codes” is now out of print. An electronic version is available on 
Berstel’s or Perrin’s homepage. 

This text is the present (January 22, 2008) version of a new book in preparation, 
coauthored with Christophe Reutenauer entitled “Codes and Automata”. The process 
of rewriting is ongoing. Comments of all kinds are wellcome. The versions will change 
for some of them quite rapidly. 

There is an important reorganization of the material. Major changes are the follow- 
ing. 


Preliminaries This chapter contains now an introduction to automata with multi- 
plicities, and a short presentation of the Perron—Frobenius theorem. 


Codes There is a detailed presentation of the algorithms for testing codicity. Unam- 
biguous automata have been moved from Chapter IV to this chapter. 


Prefix codes A new section on length distributions has been added. 


Deciphering delay This new chapter contains the section on deciphering delay of 
Chapter IT of the previous edition, and two entirely new sections. 


Bifix codes New material has been added on completion, and on length distribution. 


Circular codes This is based on the old chapter entitled “Conjugacy”. It contains a 
new presentation of the length distribution of circular codes. The sections concerning 
factorizations of free monoids now form a new chapter. 


Automata The old chapter has been cut in two pieces. The sections concerning 
groups of codes and synchronization of semaphores have been moved to other places. 


Synchronization This new chapter contains new results on locally parsable codes 
(called previouly codes with finite synchronization delay) and local automata. 


Groups of codes It is composed of the chapter “Group of bifix codes” and of material 
moved here from former Chapter IV. 


vil 


viii CONTENTS 


Factorizations of cyclic groups This is a new chapter. 
Densities This will be updated soon. 


Factorizations of free monoids This is a new chapter composed of sections previ- 
ously in the Chapter on conjugacy. 


Polynomial of a code This chapter has been entirely remodeled. 


Solutions This contains solutions to all old and new exercises. There are a lot of 
new exercises. 
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Chapter 0 


Preliminaries 


0 Introduction 


In this preliminary chapter, we give a short but complete account of some basic notions 
which will be used throughout the book. This chapter is not designed for a systematic 
reading but rather as a reference. 

The first three sections contain our notation and basic vocabulary. Each of the 
subsequent sections is an introduction to a topic which is not completely treated in 
this book. These sections are concerned mainly with the theory of automata. Kleene’s 
theorem is given and we show how to construct a minimal automaton from a given 
automaton. Syntactic monoids are defined. These concepts and results will be discussed 
in another context in Chapter pe 


1 Notation 


As usual, N, Z, Q, R, and C denote the sets of nonnegative integers, integers, and ratio- 
nal, real, and complex numbers, respectively. By convention, 0 € N. We set 


R,={cER|x>0}. 


Next, 


denotes the binomial coefficient of n and p. 
Given two subsets X,Y of a set Z, we define 


X\Y={zEeZ|zex,z¢Y}. 


An element x and the singleton set {x} will usually not be distinguished. The set of 
all subsets of a set X is denoted by B(X). 

The function symbols are usually written on the left of their arguments but with 
some exceptions: When we consider the composition of actions on a set, the action is 
written on the right. In particular, permutations are written on the right. 

A partition of a set X is a family (X;)ier of nonempty subsets of X such that 


1 


2 0. Preliminaries 


(i) X = UicerXi 

(i) Xin. Xj} =O, 6 £3). 
We usually define a partition as follows: “Let X = Ujc;X; be a partition of X”. We 
denote the cardinality of a set X by Card(X). 


2 Monoids 


A monoid is a set M equipped with an associative binary operation and has a neutral 
element. The operation is usually written multiplicatively. The neutral element is 
unique and is denoted by 1 or simply by 1. 
For any monoid M, the set P(M) is given a monoid structure by defining, for X,Y C 
M, 
XY ={ay|xexX,yeY}. 


The neutral element is {1}. 
A submonoid of M is a subset N which is stable under the operation and which 
contains the neutral element of M 


NNCN, (0.2.1) 
IMEN. 


wt 
Note that a subset N of M satisfying (Goa does not always satisfy 1jy = 1y and 
therefore may be a monoid without being a submonoid of M. 
A morphism from a monoid M into a monoid N is a function 


y:MAaN 


which satisfies, for all m, m’ € M, 


and furthermore 


A congruence on a monoid M is an equivalence relation 9 on M such that, for all 
mm € M, u,v E€ M 


mz=m  mod@ => umv = um'v mod 8. 


Let y be a morphism from M onto N. The equivalence 0 defined by m = m’ mod 9 if 
and only if (m) = v(m’) is a congruence. It is called the nuclear congruence induced 
by y. Conversely, if 9 is a congruence on the monoid M, the set M/0 of the equivalence 
classes of 0 is equipped with a monoid structure, and the canonical function from M 
onto M/6@ is a monoid morphism. 

An idempotent of a monoid M is an element e of M such that 


e =e. 
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2. Monoids 3 


For each idempotent e of a monoid M, the set eMe is a monoid contained in M. It is 
easily seen that it is the largest monoid contained in M having e as a neutral element. 
It is called the monoid localized at e. 

An element 0 of a monoid M is a zero if 0 £1 and for all m € M 


Om = m0 =0. 


If M contains a zero it is unique. 

It will sometimes be convenient to use the term semigroup to denote a set equipped 
with an associative operation but without the requirement of the existence of a neutral 
element. The notions of subsemigroup and semigroup morphism are then defined in 
the same way as the corresponding notions for monoids. 

Let M be a monoid. The set of (left and right) invertible elements of M is a group 
called the group of units of M. 

A cyclic monoid is a monoid with just one generator, i.e., 


M ={a"|neN} 


with a? = 1. If M is infinite, it is isomorphic to the additive monoid N of nonnegative 
integers. If M is finite, the index of M is the smallest integer i > 0 such that there 
exists an integer r > 1 with 

7 = dt. (0.2.2) 


. 3 ; ' . 
The smallest integer r such that Ei holds is called the period of M. The pair 
composed of index 7 and period p determines a monoid having i + p elements, 


= 2 i-1 „i itp-1 
Mip = {1,a,a°,...,07,a",...,a 2 


1 
Its multiplication is conveniently represented in Figure pat 


Figure 0.1: The monoid M; p. 


The monoid M;,, contains two idempotents (provided i > 1). Indeed, assume that 
a) = ai. Then either j = 0 or j > i and j and 2j have the same residue mod p, hence 
j =0 mod p. Conversely, if j > i and j = 0 mod p, then af = a”. 

Consequently, the unique idempotent e 4 1 in Mj, is e = a’, where j is the unique 
integer in {i,i + 1,...,i+p—1} which is a multiple of p. 
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Let M be a monoid. For z,y E€ M, we define 
a ty={zeM|az=y} and ryt = {z € M |£ = zy}. 
For subsets X,Y of M, this notation is extended to 
XY = Uzex Uyey aly and XY! =Usex Uyey zy. 


The set X~'Y is called a left residual of Y. The following identities hold for subsets 
X,Y,Z of M: 


(XY) İZ =Y 4x 2) and XHY ZH) = (XY). 


The notation X~'Y should not be confused with the product of the inverse of an 
element with another in gome group. There is only one case where the confusion could 
arise, in Chapter g Í jA where a due “caveat” will be found. 

Given a subset X of a monoid M, we define 


F(X) = MXM! 


to be the set of factors of elements in X. We have 


F(X)={m€ M | Ju, v € M:umve X}. 
We sometimes use the notation F(X) to denote the complement of F(X) in M, 
F(X) =M\ F(X). 


An important example is the monoid of relations over a nonempty set Q. A relation 
m over a set Q is a subset of Q x Q. The product of two relations m and n over Q is 
the relation mn defined by 


(p, r) Emn = Jq€Q: (p,¢) € mand (q,r) En. 


The set PH(Q x Q) of relations over a set Q is a monoid for this product. Two remarkable 
relations are the identity relation idg and the null relation, which is the empty subset 
of Q x Q. The identity relation idg is the neutral element of P(Q x Q). The null 
relation is a zero of this monoid. 

A monoid M of relations over Q is said to be transitive if for all p,q € Q, there exists 
m E M such that (p,q) E m. 


3 Words 


Let A be a set, which we call an alphabet. A word w on the alphabet A is a finite 
sequence of elements of A 


w = (a1, a2,...,an) GEA. 


The set of all words on the alphabet A is denoted by A* and is equipped with the 
associative operation defined by the concatenation of two sequences 


(a1, a2,... , an (01509; 2225 Um) = (a1, a2,... , an, 1 b2,...,b0m). 
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3. Words 5 


This operation is associative. This allows us to write 
W = A102: an 


instead of w = (a1,a2,...,@n), by identifying each element a € A with the sequence 
(a). An element a € A is called a letter. The empty sequence is called the empty word 
and is denoted by 1 or £. It is the neutral element for concatenation. Thus the set 
A* of words is equipped with the structure of a monoid. The monoid A* is called the 
free monoid on A. The set of nonempty words on A is denoted by At. Thus we have 
At = At \1 

The length |w| of the word w = aiaz ...an with a; € A is the number n of letters in 
w. Clearly, |1| = 0. The function w + |w| is a morphism from A* onto the additive 
monoid N. For n > 0, we use the notation 


A™ = {w e€ A* | |w] <n—-1} 


and also 
AP = {w € A* | jw] < n}. 
In particular, A® = and A = {1}. 
For a subset B of A, we let |w|g denote the number of letters of w which are in B. 


Thus 
jw] = S lwla: 


acA 
For a word w € A*, the set 


alph(w) = {a € A | |wla > 0} 
is the set of all letters occurring at least once in w. For a subset X of A*, we set 
alph(X) = Uzex alph(z). 


A word w € A* is a factor of a word x € A* if there exist u,v E€ A* such that z = uwv. 
The relation is a factor of is a partial order on A*. A factor w of x is proper if w £ x. 
A word w € A* is a prefix of a word x € A* if there is a word u € A* such that 

x = wu. The factor w is called proper if w # x. The relation is a prefix of is again a 

partial order on A* called the prefix order. We write w < x when w is a prefix of x and 

w < x whenever w < x and w Æ x. This order has the following fundamental property. 

If for some z, 

w< a, w IT, 


then w and w’ are comparable, i.e., 
1 
w <w or w <w. 


In other words, if vw = v'w', then either there exists s € A* such that v = v's (and 
also sw = w’) or there exists t € A* such that v’ = vt (and then w = tw’). 

In an entirely symmetric manner, we define a suffix w of a word x by x = vw for 
some v € A*. A set P C A* is called prefiz-closed if it contains the prefixes of its 
elements: uv E€ P > u € P. A suffix-closed set is defined symmetrically. 
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Consider a totally ordered alphabet A. The lexicographic order on A* is defined by 
setting u < v if u is a proper prefix of v, or if u = ras, v = rbt, a < b for a,b € A and 
r,s,t E€ A*. The lexicographic order has the property 


U<US WU ~< WU. 


Similarly, the radix order on A* is defined by setting u < v if |u| < |u| or if |u| = |v] 
and u ~ v in the lexicographic order. 
The reverse w of a word w = a1a2'-: an, a; E A, is 


Ù = An°+:a2Q1. 
The notations w and w are equivalent. Note that for all u,v € A*, 
(uv) = vu. 


The reverse X of a set X C A* is the set X = {%| ax € X}. 
A factorization of a word w € A* is a sequence {u,U2,-.-,Un} of n > 0 words in A* 
such that 
wW = UIU? ... Un. 


For a subset X of A*, we denote by X* the submonoid generated by X, 
X* = {£1£2 -£n | Nn > 0, x; E X}. 
Similarly, we denote by X* the subsemigroup generated by X, 


Xt = {x1%9-++tn,|n > 1,2; € X}. 


We have 
ge [rt BEX, 
X* otherwise. 
By definition, each word w in X* admits at least one factorization (£1, £2,..., £n) 


whose elements are all in X. Such a factorization is called an X-factorization, We 
frequently use the pictorial representation of an X-factorization given in Figure ? 


X4 X2 Xn 


w 


Figure 0.2: An X-factorization of w 


A word x € A* is called primitive if it is not a power of another word. Thus x is 
primitive if and only if x = y” with n > 0 implies x = y. Observe that the empty word 
is not primitive. 

Two words x,y are called conjugate if there exists words u, v such that z = wv, y = vu. 
(See Figure We frequently say that y is a conjugate of x. Two conjugate words 
are obtained from each other by a cyclic permutation. More precisely, let y be the 
function from A* into itself defined by 


y(1) =1 and y(av) = va (0.3.1) 
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fora € A, v € A*. It is clearly a bijection from A* onto itself. Two words x and y are 
conjugate if and only if there exists an integer n > 0 such that 


r=7"(y). 


This easily implies that the conjugacy relation is an equivalence relation. A conjugacy 
class is a class of this equivalence relation. 


u oo 
y 


Figure 0.3: Two conjugate words 


PROPOSITION 3.1 Each nonempty word is a power of a unique primitive word. 


Proof. Let x € A* and 6 be the restriction of the function y defined by (bt to the 
conjugacy class of x. Then ô% = 1 if and only if x is a power of some word of length 
dividing k. 

Let p be the order of ô, i.e., the g.c.d. of the integers k such that 6° = 1. Since 6? = 1, 
there exists a word r of length p such that x = r° with e > 1. The word r is primitive, 
otherwise there would be a word s of length q dividing p such that r € s*, which in 
turn implies that x € s*, contrary to the definition of p. This proves the existence of 
the primitive word. To show uniqueness, consider a word t € A* such that x € t* and 
let k = |t|. Since 6* = 1, the integer k is a multiple of p. Consequently t € r*. Thus if 
t is primitive, we have t =r. n 


Let x € At. The unique primitive word r such that x = r” for some integer n is 
called the root of x. The integer n is the exponent of x. 


PROPOSITION 3.2 Two nonempty conjugate words have the same exponent and their 
roots are conjugate. 


Proof. Let x,y € At be two conjugate words, and let i be an integer such that y = y*(z). 
Set r and s be the roots of x and y respectively and let n be the exponent of x. Then 


This shows that y'(r) € s*. Interchanging the roles of x and y, we have yÍ (s) € rx. It 
follows that y‘(r) = s and 7/(s) = r. Thus r and s are conjugate and consequently x 
and y have the same exponent. n 


PROPOSITION 3.3 All words in a conjugacy class have the same exponent. If C is a 
conjugacy class of words of length n with exponent e, then 


Card(C) = n/e. 
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Proof. Let x € A” and C be its conjugacy class. Let ô be the restriction of y to C and 
p be the order of 6. The root of x is the word r of length p such that x = r°. Thus 
n = pe. Now C = {x,6(x)...,5?-1(x)}. These elements are distinct since p is the 


order of 6. Thus Card(C) = p. m 
n 1 2 3 4 5 6 7 8 9 10 ll 12 
la(2) 2 1 2 3 6 9 18 30 56 99 186 335 
la(3) 3 3 8 18 48 116 
la(4) 4 6 20 


Table 0.1: The number @,,(k) of primitive conjugacy classes over a k-letter alphabet. 


We now compute the number of conjugacy classes of words of given length over a 
finite alphabet. Let A be an alphabet with k letters. For all n > 1, the number of 
conjugacy classes of primitive words in A* of length n is denoted by @,,(k). The notation 
is justified by the fact that this number depends only on k and not on A. a 

The first values of this function, for k = 2,3,4, are given in Table BA Clearly 
fn(1) = 1 if n = 1, and @,(1) = 0 otherwise. Now for n > 1 


=Y déa(k), (0.3.2) 


d|n 


where d runs over the divisors of n. Indeed, every word of length n belongs to exactly 
one conjugacy class of words of length n. Each class has d = n/e elements, where e is 
the exponent of its words. Since there are as many classes whose words have exponent 
n/e as there are classes of primitive words of length d = n/e, the result follows. n 


We can obtain an explicit expression for the numbers @,,(k) by using the classical 
technique of Mobius inversion which we now recall. 
The Mobius function is the function 


w:N\ {0} >N 
defined by u(1) = 1 and 


‘ie if n is the product of i distinct prime numbers, 


0 otherwise. 


PROPOSITION 3.4 (Möbius Inversion Formula) Let a, be two functions from N \ 0 
into N. Then 


a(n)=S° Bd) (n> 1) (0.3.3) 


d\n 
if and only if 
Bln) = SD ulda(n/a) (n21): (0.3.4) 


d|n 
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3. Words 9 


Proof. Define a product on the set S of functions from N \ 0 into N by setting, for 
fgeEs 
feg(n)= X f@gle)- 


n=de 
It is easily verified that S is a commutative monoid for this product. Its neutral element 
is the function J taking the value 1 for n = 1 and 0 elsewhere. 
Let 1 € S be the constant function with value 1. Let us verify that 


be p=T (0.3.5) 


Indeed ¢ * u(1) = 1; for n > 2, let n = pr py? ---pkm by the prime decomposition of n. 
If d divides n, then (d) Æ 0 if and only if 


Li £ Lm 
d= pi p? ... Pra 


with all 4; = 0 or 1. Then u(d) = (—1)! with t = 7", 4. It follows that 


ula) = So wd) = Sen) =0. 


23 4 
is equivalent to a = x 8 and Formula (say 
hese two formulas are equivalent. n 


PROPOSITION 3.5 The number of conjugacy classes of primitive words of length n over 
an alphabet with k letters is 


talk) = = So uln/dk. 
d\n 


.2 
Proof This is immediate from Formula (Sah by Mobius inversion. n 


A word w € A® is called unbordered if no proper nonempty prefix of w is a suffix of 
w. In other words, w is unbordered if and only if 


weuAtnAtu implies u=1. 


u a b"! 
Figure 0.4: The word uabl™! 0_04 
If w is unbordered, then 
wA* N A*w=wA*wuw. 
The following property holds. 


PROPOSITION 3.6 Let A be an alphabet with at least two letters. For each word u € AF, 
there exists v E€ A* such that uv is unbordered. 
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Proof. Let a be the first letter o and let b € A\ a. Let us verify that the word 
w = uabl“| is unbordered (Figure pete A nonempty prefix t of w starts with the letter 
a. It cannot be a suffix of w unless |t| > |u|. But then we have t = sab!“! for some 
s € A*, and also t = uabl*!. Thus |s| = |u|, hence t = w. n 


Let A be an alphabet. The free group A© on A is defined as follows: Let A be an 
alphabet in bijection with A and disjoint from A. Denote by 


ara 

the bijection from A onto A. This notation is extended by setting, for all a € AU A, 
a=a. 

Let ô be the symmetric relation defined for u,v € (AU A)* and a € AU A by 


uaav = uv mod ô. 


Let p be the reflexive and transitive closure of ô. Then p is a congruence. The quotient 
A® = (AU A)*/p is a group. Indeed, for all a € AU A, 


aa = 1 mod p. 
Thus the images of the generators are invertible in A®. The group A® is called the free 
group on A. 
4 Automata 


Let A be an alphabet. An automaton over A is composed of a set Q (the set of states), 
a subset I of Q (the initial states), a subset T of Q (the terminal or final states), and 
a set 


FCQxAxQ 
called the set of edges. The automaton is denoted by 


A=(O. 17); 
The automaton is finite when the set Q is finite. 
A path in the automaton A is a sequence c = (f1, fo,.-., fn) of consecutive edges 
fi = (Gi, ai, G41); 1<i<n. 


The integer n is called the length of the path c. The word w = aias- -an is the label 
of the path c. The state qı is the origin of c, and the state qn+1 the end of c. A useful 
notation is 

c: qi — Qn+1- 


By convention, there is, for each state q € Q, a path of length 0 from q to q. Its label 
is the empty word. 
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A path c: i — t is successful if i € I and t € T. The set recognized by A, denoted by 
L(A), is defined as the set of labels of successful paths. 

A state q € Q is accessible (resp. coaccessible) if there exists a path c : i — q with 
i € I (resp. a path c : q >t with t € T). An automaton is trim if each state is both 
accessible and coaccessible. Let P be the set of accessible and coaccessible states, and 
let A? = (P, I N P,T N P). Then it is easy to see that A? is trim and 


L(A) = L(A’). 


The preceding terminology is of course reminiscent of graph theory. In this frame- 
work, an automaton is just a labelled multigraph equipped with two distinguished 
subset of vertices, the initial and the terminal states. The multigraph having Q as set 
of verticies, and F as set of edges, is called the underlying graph of the automaton. An 
automaton is called strongly connected if its underlying graph is strongly connected, 
that is if for any pair (p,q) of states (vertices), there is a path from p to q. 

Let A = (Q,I,T) be an automaton over A. For each word w, we denote by y4(w) 
the relation over Q defined by 


(p,q) € paw) —> p >q 


It follows from the definition that p4 is a morphism from A* into the monoid of relations 
over Q. The submonoid y4(A*) is called the transition monoid of the automaton A. 
Clearly, an automaton is strongly connected if and only if its transtion monoid is 
transitive. 
An automaton A = (Q,1,T) is deterministic if Card(I) = 1 and if 


(p,a,q), (p,a,r) EF >q=r. 


Thus for each p € Q and a € A, there is at most one state q in Q such that p  q. For 
p EQ, and a E€ A, define 


q if (p,a,q) EF, 
p:a= 
Ø otherwise. 


The partial function from Q x A into Q defined in this way is extended to words by 
setting p- 1 = p for all p € Q, and, for w € A* and a € A, 


p: wa = (p-w)-a. 


This function is called the transition function or next-state function of A. With this 
notation, we have with I = {i}, 


L(A) ={we A*|i-weT}. 


An automaton is complete if for all p € Q, a € A, there exists at least one q € Q such 
that p = q. 


PROPOSITION 4.1 For each automaton A, there exists a complete deterministic au- 
tomaton B such that 


L(A) = L(B). 
If A is finite, then B can be chosen to be finite. 
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Proof. Set A = (Q,1,T). Define B = (R,u,V) by setting R = B(Q), u=1, 
V={ScqQ|sSnTF}. 


Define the transition function of B, for S € R, a € A by 


S-a={q4EQ|IsES:s =q}. 


The automaton B is complete and deterministic. It is easily seen that L(A) = L(B). 


Let A = (Q, i, T) be a deterministic automaton. For each q € Q, let 
[g={weA*|q-weT}. 


Two states p,q € Q are called inseparable if Lp = Lg, and separable otherwise. A 
deterministic automaton is reduced if two distinct states are always separable. 

Let X be a subset of A*. We define a special automaton A(X) = (Q,i,T) in the 
following way. The states of A(X) are the nonempty sets u~!X for u € A*. The initial 
state is X = 171X, and the final states are those containing the empty word. The 
transition function is defined for a state Y = u~!X and a letter a € A by 


Y-a=alyY. 
Observe that this indeed defines a partial function. We have 
L(A(X)) =X. 
Indeed, an easy induction shows that X -w—=w7!X for w € A*. Consequently 
we L(A(X))S1lexX-welewlXewex. 
The automaton A(X) is reduced. Indeed, for Y = u7!X, 
Ly = {v € A |Y - veT}={veEA*|wexX}. 


Thus Ly = Y. 
The automaton A(X) is called the minimal automaton of X. This terminology is 
justified by the following proposition. 


PROPOSITION 4.2 Let A = (Q,i,T) be a trim deterministic automaton and let X = 
L(A). Let A(X) = (P, j, S) be the minimal automaton of X. The function yp from Q 
into P defined by p(q) = Lg is surjective and satisfies p(q-a) = p(q)-a. 


Proof. Let q E Q. Let u € A* such that i-u = q. Then 
Lg={weA*|q-weTh=u'X. 


Since A is trim, Lz Æ Ø. This shows that Ly € P. Thus ¢ is a function from Q into P. 
Next, let us show that ọ is surjective. Let u~!X € P. Then u-!X #0. Thusi-u #0 
and setting q = i- u, we have Ly = u-!X = y(q). Thus ¢ is surjective. 
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Finally, for q = i - u, one has y(q a) = Lga = (ua) 1X = (uw! X) -a = Lq a. m 


Assume furthermore that the automaton A in the proposition is reduced. Then the 
function y is a bijection, which identifies A with the minimal automaton. In this sense, 
there exists just one reduced automaton recognizing a given set. 

Let A = (Q,i,T) be a deterministic automaton. Consider the set F of partial 
functions from Q into Q. These functions are written on the right: if q € Q and 
m E F, then the image of q by m is denoted by gm. Composition is defined by 


q(mn) = (qm)n. 


Thus F has a monoid structure. 


Let y be the function which to a word w € A* associates the partial function from 
Q into Q defined by 


qp(w) =q-w. 
The function y is a morphism from A* into the monoid F. The submonoid y(A*) 
of F is called the transition monoid of the automaton A. This is consistent with the 


terminology for general automata. 
Observe that, setting X = L(A), we have 


ox) =x: (0.4.1) 


Indeed w € y~!y(X) if and only if y(w) € Y(X) which is equivalent to ip(w) € T, ie. 
towe X. 
A morphism y from a monoid M onto a monoid N is said to recognize a subset X of 


M if 
g'y(X) =X. 
A subset X of M is recognizable if it is recognized by a morphism onto a finite monoid. 
Let X be a subset of A*. For w € A*, set 
T'(w) = {(u,v) € A* x A* | uwv E€ X}. 
The syntactic congruence of X is the equivalence relation ox on A* defined by 


w=w' modox 4> I(w) =T(w’). 


It is easily verified that ox is a congruence. The quotient of A* by ox is, by definition, 
the syntactic monoid of X. We denote it by M(X), and we denote by yx the canonical 
morphism from A* onto M(X). 


PROPOSITION 4.3 Let X be a subset of A*, and let py: A* — M be a surjective mor- 


phism. If p recognizes X, then there exists a morphism p from M onto the syntactic 
monoid M(X) such that 


px =o. 
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Proof. It suffices to show that 
p(w) = p(w’) > px (w) = px(w’) (0.4.2) 


2 
Indeed, if (EF holds, then for | Se m E€ M, (m) is defined as the unique 


element in yx(y~!(m)). To show > we consider (u,v) € I'(w). Then uwv € X. 
Thus y(u)y(w)p(v) € Y(X). From y(w) = ¢(w’), it follows that y(u)y(w’)y(v) € 
(X). Since y recognizes X, this implies that uw'v € X. Thus (u,v) € T'(w’). n 


PROPOSITION 4.4 Let X be a subset of A*. The syntactic monoid of X is isomorphic 
to the transition monoid of the minimal automaton A(X). 


Proof. Let M be the transition monoid of E fi he ine A(X) = (Q,i,T) and let 
p: A* Mb the canonical morphism. By ( , the morphism y recognizes X. By 
Proposition oe exists a morphism 7 from M onto the syntactic monoid M(X) 
such that yx = Woy. 

It suffices to show that ~ is injective. For this, consider m, m’ € M such that y(m) = 
w(m’). Let w,w € A* such that y(w) = m, p(w’) = m’. Then x(w) = yx(w’). To 
prove that y(w) = y(w’), we consider a state p E€ Q, and let u € A* be such that 
p=u!X. Then 


pp(w) =p: w = (uw) X = {v € A* | (u,v) € P(w)}. 
Since [(w) = T (w’), we have py(w) = py(w’). Thus y(w) = p(w’), ie. m = m. m 
We now turn to the study of properties which are specific to finite automata. 


THEOREM 4.5 Let X C A*. The following conditions are equivalent. 
(i) The set X is recognized by a finite automaton. 
(ii) The minimal automaton A(X) is finite. 
(iii) The family of sets u-X, for u € A*, is finite. 
(iv) The syntactic monoid M(X) is finite. 
(v) The set X is recognizable. 


E .4.1 
Proof. (i) = (ii). Let A be a finite automaton Hon Bee X. By Proposition fr i we 


can assume that A is deterministic. By Proposition #.2, the minimal automaton A(X) 
also is finite. 

(ii) = (iii) is clear. AS 

(ii) => (iv) holds by Proposition and by the fact that the transition monoid of a 
finite automaton is always finite. 

(iv) = (v) is clear. 

(v) = (i). Let y : A* — M bea morphism onto a finite monoid M, and suppose that 
y recognizes X. Let A = (M,1,y(X)) be the deterministic automaton with transition 
function defined by 

m-a=my(a). 


Then 1-w € (X) if and only if p(w) € Y(X), thus if and only if w € X. Consequently 
LA) =X. 7 
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PROPOSITION 4.6 The family of recognizable subsets of A* is closed under all Boolean 
operations: union, intersection, complementation. 


Proof. Let X,Y C A* be two recognizable subsets of A*. Let A = (P,i,5) and 
B = (Q,j,T) be complete deterministic automata such that X = L(A), Y = L(B). Let 


C = (P x Q, (i,j), R) 


be the complete deterministic automaton defined by 


(p,q): a= (p-a,q-a). 


For R = (SxQ)U(PxT), we have L(C) = XUY. For R= SxT, we have L(C) = XAY. 
Finally, for R= S x (Q \ T), we have L(C) = X \Y. m 


PROPOSITION 4.7 Let a: A* — B* be a morphism. If Y is a recognizable subset of 
B*, then X = a7} (Y) is a recognizable subset of A*. 


Proof. Since Y is recognizable, one has Y = y~!(y(Y), where y is a morphism from B* 
onto a finite monoid M. Setting y = poa: A* — M, it follows that X = ~!(a(X)). 


PROPOSITION 4.8 If x C A* is recognizable, then Y~!X is recognizable for any subset 
Y of A*. 


Proof. One has u~!(Y~!X) = Uyey(yu) 1X. Since X is recognizable, there are finitely 
many sets of the form (yu)~'X, and thus of the form u~!(Y~1!X). This shows that 
Y—!X is recognizable. n 


Consider now a slight generalization of the notion of automaton. An asynchronous 
automaton on A is an automaton A = (Q,1,T), the edges of which may be labeled by 
either a letter or the empty word. Therefore the set of its edges satisfies 


FCQx(AU1)xQ. 


The notions of a path or a successful path extend in a natural way so that the notion 
of the set recognized by the automaton is clear. 


PROPOSITION 4.9 For any finite asynchronous automaton A, there exists a finite au- 
tomaton B such that L(A) = L(B). 


Proof. Let A = (Q,1,T) be an asynchronous automaton. Let B = (Q,1,T) be the 
automaton, the edges of which are the triples (p,a,q) such that there exists a path 
p— qin A. We have 

L{A)N At = L(B) Nn AT. 


If IQT #90, both sets L(A) and L(B) contain the empty word and are therefore 
equal. Othe wise, the sets are equal up to the empty word and the result follows from 
Proposition ae! the set {1} is recognizable. n 


The notion of an asynchronous automaton is useful to prove the following result. 
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PROPOSITION 4.10 If X C A* is recognizable, then X* is recognizable. If X,Y C A* 
are recognizable, then XY is recognizable. 


Proof. Since X* = (X \1)* and since X \ 1 is recognizable, we may suppose that 1 ¢ X. 
Let A = (Q,1,T) be a finite automaton recognizing X. Let F be the set of its edges. 
Since 1 ¢ X, we have IAT = Ø. Let B = (Q,1,T) be the asynchronous automaton 
with edges 

Fu(T & {1} 2). 


Then L(B) = X* . In fact, the inclusion X* \ 1 C L(B) is clear. Conversely, let 
c:i— + j be a successful path in B. By the definition of B, this path has the form 


. w1 1 . we - Wn 
eC: > ty > 19 > t2 ++- tn —> tn 


with i = 24,7 = tn and where no path cy : ik k, tp contains an edge labeled by the 
empty word. Then wy, w2, ..., Wn E€ X and therefore w € XT . 

Now let A = (P, JI, S) and B = (Q, J,T) be two finite automata with sets of edges F 
and G, respectively. Let X = L(A) and let Y = L(B). One may assume that PNQ = 0. 
Let C = (PUQ,I,T) be the asynchronous automaton with edges 


FUG US x41} x J). 
Then L(C) = XY as we may easily check. m 


We shall now give another characterization of recognizable subsets of A*. Let M be 
a monoid. The family of rational subsets of M is the smallest family R of subsets of 
M such that 

(i) any finite subset of M is in R, 
(ii) if X,Y € R, then XUY E R, and XY ER 
(iii) if X € R, then X* ER 


PROPOSITION 4.11 Leta : A* — B* be a morphism. If X is a rational subset of A*, 
then a(X) is a rational subset of B*. 


Proof. The conclusion clearly holds if X is finite, and if it holds for two subsets X; and 
Xə of A*, it holds for their union, their product, and the star. So it holds for every 
rational subset of A*. . 


THEOREM 4.12 Let A be a finite alphabet. A subset of A* is recognizable if and only 
if it is rational. 


Proof. Denote by Rec(A*) the family of recognizable subsets of A* and by Rat(A*) 
that of rational subsets of A*. Let us first prove the inclusion Rat(A*) C Rec( 4y. „p 
act, apy finite subset X of A* is clearly recognizable. Moreover, Propositions an 

show that the family Rec(A*) satisfies conditions (ii) and (iii) of the definition of 
Rat(A*). This proves the inclusion. 

To show that Rec(A*) C Rat(A%*), let us consider a recognizable subset X of A*. 
Let A = (Q,1,T) be a finite automaton recognizing X. Set Q = {1,2,...,n} and for 
1<4i,j <n, 

Xij = {w € A* |i j}. 
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5. Semirings and matrices 


We have 

X = Vier Ujer Xij- 
It is therefore enough to prove that each X; ; is rational. For k € {0,1,...,n}, denote 
by X the set of those w € A* such that there exists a path c : i > j passing only 


through states | < k except perhaps for i,7. In other words we have w € X y if and 
only if w = ajag°-- Gm with 


and 11 <k,...,im—1 < k. We have the formulas 


XO = AUL (0.4.3) 

ao = Xij, (0.4.4) 

K SURO aa ar Wate (0.4.5) 

Since A is finite, X D € Rat(A*) by EES Then ( shows by induction on k > 0 
that x € Rat(A*). Therefore X;,; € Rat(A*) by (ie n 


In the case of an infinite alphabet, a recognizable set needs not to be rational: for 
instance the alphabet itself is recognizable but not rational. However, any recognizable 
set is the inverse image, by a length preserving morphism, of a recognizable set X over 
a finite alphabet. Indeed, this morphism identifies letters with the same image in the 
syntactic monoid of X. The common usage is to call regular a recognizable subset of 
A*. The previous theorem states that regular sets and rational sets are the same for 
finite alphabets. 


COROLLARY 4.13 The family of regular sets over finite alphabets is closed under Bool- 
ean operations, rational operations, morphisms and inverse morphisms, and left and 
right quotient by arbitrary sets. n 


5 Semirings and matrices 


A semiring K is a set equipped with two operations denoted + and - satisfying the 
following axioms: 

(i) The set K is a commutative monoid for + with a neutral element denoted by 0. 
(ii) The set K is a monoid for multiplication with a neutral element denoted by 1. 
(iii) Multiplication is distributive on addition. 

(iv) For al £e K,0-cx=2-0=0. 

Clearly, any ring with unit is a semiring. Other examples of semirings are as follows. 
The set N of natural integers is a semiring and so is the set R+} of nonnegative real 
numbers. 

The Boolean semiring B is composed of two elements 0 and 1. The axioms imply 


0+0=0,04+1=1+0=1, 
0-1=1-0=0-0=0. 
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The semiring B is specified by 
1+1=1. 


The other possibility, viz., 1 + 1 = 0, defines the field Z/2Z. 

More generally, for any integer d > 0, consider the set B(d) = {0,1,...,d+ 1}. It 
becomes a semiring for integer addition and multiplication defined, for i,j € B(d), 
respectively by min(i + j,d + 1) and min(ij,d+ 1). In particular, 6(0) = B. 

For any monoid M, the set P(M) is a semiring for the operations of union and set 
product. 

A semiring K is called ordered if it is given with a partial order < satisfying the 
following properties: 

(i) 0 is the smallest element of K; 

(ii) the following implications hold: 


tIy>r+z<y+z, 
x£ ILY => TZ Lyz, zZz£ťz< zy. 


The semirings 5, N, R4 are ordered by the usual ordering. 

LIYSrt=y+z. 
An ordered semiring is said to be complete if any subset X of K admits a least upper 
bound in K. It is the unique element k of K such that 


G) CX = ask, 
(ii) if x < k for all z € X, then k < k’. 
We denote k = sup(X) or k = sup{z | x € X} or k = supzex(x). Semiring B is 
complete. Semirings N, R+ are not complete. They may be completed according to the 
following general procedure: 
Let K be an ordered semiring whose order is total. Let 


K=KUow. 


where œo ¢ K. The operations of K are extended to K by setting for x € K, 


(i) r+o=w+r=00, 
(ii) if £0, then roo = cor = œ, 
(ii) œœ =œ, 0œ=œ0=0. 


Extending the order of K to K by x < oo for all x € K, the set K becomes a totally 
ordered semiring. It is a complete semiring because any subset has an upper bound 
and therefore also a least upper bound. We define 


N =NUoœ, R+ = R4 U œ 


to be the complete semirings obtained by applying this construction to N and R4. If K 
is a complete semiring, the sum of an infinite family (x;)je7, of elements of K is defined 


by 
Joa = sup{ > zj| JCI,J finite} (0.5.1) 


icI jEJ 
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In the case of semiring R4 this gives the usual notio a summable family: A family 
(x;)ier of elements in R4} is summable if the sum ( ris finite. 
In particular, for a sequence (%7)n>0 of elements of a complete semiring, we have 


`> Tn = sup{ > aS ; (0.5.2) 


since any finite subset of N is contained in some interval {0,1,...,n}. Moreover, if 
I =Uje 7; is a partition of J, then 


Sia = OS Pje (0.5.3) 


i€l jeJ tel; 
Consider, for instance, a mapping 7 : A* — R,. Let X be a subset of A* and let 
Xn = {x€ X ||e| <n}. 


Then 


ye r(x) = sup{ n(x)} : 
rEeX n20 ee 
Let P, Q be two sets and let K be a semiring. A P x Q-matrixz with coefficients in K 
is a Mapping 
m:PxQ—-K. 


We denote indistinctly by 
(p,m, q) or ™p,q 


the value of m on (p,q) € P x Q. We also say that m is a K-relation between P and 
Q. If P = Q, we say that it is a K-relation over Q. The set of all K-relations between 
P and Q is denoted by K?*®. 

Let m € K?X@ bea K-relation between P and Q. For p € P, the row of index p of 
m is denoted by Mpx. It is the element of K 2 defined by 


(Mpx)q = Mpq - 


Similarly, the column of index q of m is denoted by Maq. It is an element of K?. Let 
P,Q, R be three sets and let K be a complete semiring. For m € K?*®@ and n € K@**, 
the product mn is defined as the following element of K?**. Its value on (p,r) € PX R 


is 
(mn)pr = > Mp,qhq,r - 
qEQ 


When P = Q = R, we thus obtain an associative multiplication which turns K@*® 
into a monoid. Its identity is denoted idg or Ig. 

A monoid of K-relations over Q is a submonoid of K@*®. It contains in particular 
the identity idg. 

We now consider properties of nonnegative matrices. Let Q be a set of indices. For 
two @-vectors v,w with real coordinates, one writes v < w if vg < w, for all q € Q and 
v < w if vg < wg for all q E€ Q. A vector v is said to be nonnegative (resp. positive) 
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if v > 0 (resp. v > 0). In the same way, for two Q x Q-matrices M,N with real 
coefficients, one writes M < N when Mp q < Np q for all p,q E€ Q and M < N when 
Mp < Npq for all p,q E€ Q. The Q x Q-matrix M is said to be nonnegative (resp. 
positive ) if M > 0 (resp. M > 0). We shall use often the elementary fact that if 
M > 0 and v È> 0 with v £0, then Mv > 0. 

A nonnegative matrix M is said to be irreducible if for all indices p,q, there is an 
integer k such that M$ q > 0, where M k denotes the k-th power of M. It is easy to 
verify that M is irreducible if and only if (Z + M)” > 0 where n is the dimension of 
M. It is also easy to prove that M is reducible (i. e. M is not irreducible) if there is a 
reordering of the indices such that M is block triangular, i.e. of the form 


(0.5.4) 


TE 


0 W 


with U, W of dimension > 0. 
The following result is part of a theorem known as the Perron-Frobenius theorem. 


THEOREM 5.1 Any nonnegative matrix M has a real eigenvalue pm such that |A| < pm 
for any eigenvalue À of M and there corresponds to pm a nonnegative eigenvector v. 
If M is irreducible, there corresponds to pm a positive eigenvector v. 


eq:reductible 


Proof. Let us first show that one may reduce to the case where M is irreducible. =| es Sdetivie 


if M is reducible, we may consider a triangular decomposition as in Equation 
above. Applying by induction the theorem to U and W, we obtain the result with pm 
equal to the maximal value of the moduli of eigenvalues of U and W. The corresponding 
eigenvector is completed with zeroes (and thus condition that the eigenvector is positive 
fails to hold). We suppose from now on that M is irreducible. For any Q-vector v 4 0, 
let 

ru(v) =min{(Mv);/v; | 1<i<n, vi #0}. 


Thus rj¢(v) is the largest real number r such that Mv > rv. One has rm (Av) = rm (v) 
for any real number A # 0. Moreover, rm is continuous on the set of nonnegative 
vectors. 

The set X of nonnegative vectors v such that ||v|| = 1 is compact. Since a continuous 
function on a compact set reaches its maximum on this set, there is an x € X such that 
ru(x) = pm where py = max{ry(w) | w € X}. Since ry(v) = ru (Av) for A Æ 0, we 
have py = max{ry(w) | w > 0,w Æ 0}. 

We show that Ma = pyx. By the definition of the function rm, we have Max > pmt. 
Set y = Mx—pye. Then y > 0. Assume Mx # pmr. Then y 4 0. Since (I+ M)” > 0, 
this implies that the vector (J + M)"y is positive. But 


(I+ M)"y = (I+ M)"(Mz — pmr) = M(I + M)"2 — pm(I+ M)"¢ = Mz — pz, 


with z = (I + M)"zx. This shows that Mz > pmz, which implies that rm(z) > pm, a 
contradiction with the definition of rm. This shows that pm is an eigenvalue with a 
nonnegative eigenvector. 

Let us show that pm > |A| for each real or complex eigenvalue A of M. Indeed, let 
v be an eigenvector corresponding to A. Then Mv = Av. Let |v| be the nonnegative 
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vector with coordinates |v;|. Then M|v| > Alv| by the triangular inequality. By the 
definition of the function rm, this implies rjy(|v|) > |A| and consequently pm > |A|. 
We have already seen that there corresponds to pm a nonnegative eigenvector x. Let 
us now verify that x > 0. But this is easy since (I+ M)"x = (1+pm)"2, which implies 
that (1+ pm)”x > 0 and thus x > 0. n 


The spectral radius of a matrix is the maximal modulus of its eigenvalues. Thus, the 
above theorem says that the spectral radius of a nonnegative matrix is an eigenvalue. 


is 


1 0 2 
There corresponds to y the nonnegative left eigenvector [yp 1]. 


EXAMPLE 5.1 Let M = k aE The eigenvalues of M are y = LVS and yg! = & 


Let k be an integer. A k-approximate eigenvector of a nonnegative matrix M is, by 
definition, an vector v Æ 0 with integer nonnegative components such that 


Mv < kv. 


Assume that M is the adjacency matrix of some graph G. One useful way to think 
about an approximate eigenvector of M is that it assigns a nonnegative integer weight 
Uq to each vertex q. The vector inequality Mv < kv corresponds to the condition that 
for each vertex p, if we add up the weights of the ends of all edges starting at p, the 
sum should be at most k times the weight of p. 

We will use the following result. Its proof uses the Perron-Frobenius theorem. 


PROPOSITION 5.2 An irreducible nonnegative and integral matrix M with spectral ra- 
dius À admits a positive k-approximate eigenvector if and only if k > À. 


Proof. Suppose first that k > A. Consider the matrix N = kI—M. Since k > A, we have 
det(N) > 0 and thus N is invertible. Moreover, since NT! = (I+M/k+M?/k?+...)/k, 
and since M is irreducible, the matrix N~! is positive. Let v be a column of N~!. We 
have Nv > 0 and thus Mv < kv. Any column of N7! is then a positive k-approximate 
eigenvector of M. 

If k = A, there is by the Perron-Frobenius theorem, a positive vector v such that 
Mv = kv. Since A is an integer, the coefficients of v can be chosen integral. 

Let us finally prove that conversely, if M admits a positive k-approximate eigenvector 
v, then k > A. Consider the matrix N = iM . By the Perron-Frobenius theorem, there 
is a positive vector w such that Nw = w. We have Nv < (k/A)v, thus for all n > 1, 
Nv < (k/A)"v. If A > k, the right hand side tends to 0 as n — oo, thus N” tends to 
the zero matrix, a contradiction with the fact that N”w = w with w > 0. n 


EXAMPLE 5.2 Let M = i 1 . The spectral radius of M is strictly less than 2 and a 


1 0 


F ; . |1 
2-approximate eigenvector is | | ; 
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6 Formal series 


Let A be an alphabet and let K be a semiring. A formal series (or just series) over A 
with coefficients in K is a mapping 


a: A >K. 
The value of ø on w € A* is denoted (a, w). We indifferently denote by 
K* or K(A)) 


the set of formal series over A. We denote by K(A) the set of formal series o € K ((A)) 
such that (o, w) = 0 for all but a finite number of w € A*. An element of K(A) is 
called a polynomial. A series 0 € K((A)) can be extended to a linear function from 
K(A) into K by linearity. For p € K(A), 


(c,p) = X (o,w)(p,w) 


we A* 


This definition makes sense because p is a polynomial. Let o,r € K((A)) and kE K. 
We define the formal series o + 7, or, and ko by 


(o+7,w) =(o,w)+(7,w), (0.6.1) 
(o7,w) = D> (,u)(7, 0), (0.6.2) 
(ko, w) = k(o,w). (0.6.3) 


12 
In (66-9) the sum runs over the 1 + |w| pairs (u,v) such that w = uv. It is therefore 
a finite sum. The set KA} contains two special elements denoted 0 and 1 defined by 


1 ifw=1 


0 otherwise. 


As usya ye denote. on =o0---o (n times) and o? = 1. With the operations defined 
by ( and ( the set K((A)) is a semiring. It may be verified that when K is 
complete K ((A)) is also complete. 


The support of a series o E€ K((A)) is the set 


supp(7) = {w € A* | (o,w) #0}. 


(0, w) = 0, Che) = 


The mapping o + supp(c) is an isomorphism from 6((A)) onto P(A*). 
A family (0;)ier of series is said to be locally finite if for all w € A*, the set 


{i € L| (oi w) 4 O} 


is finite. In this case, a series ø denoted 
o = ò Oi 
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can be defined by 


(o, w) = > (oyu) ; (0.6.4) 


tel 
l ; ‘ 4 . 
This notation makes sense because in the sum pean but a finite number of terms 
are different from 0. We easily check that for a locally finite family (ciJier, 


T (£a) = 5 aap 

iel iel 

Let o € K((A)) be a series with zero constant term, that is such that 
(ol =O; 


Then the family (o”)n>0o is locally finite. In fact, the support of a” does not contain 
words of length less than n. We denote 


g= a”, ot =5 o”. 
n>0 n>1 


Then 
o* =1 +07", oo = 0o0* =o" =” 


PROPOSITION 6.1 Let K be a ring with unit and let o € K((A)) be a series such that 
(o,1) =0. Then 1— o is invertible and 


rae; (0.6.5) 
Proof. We have 
1=o0* — ot = o* -—o*o = o* (1-2). 
Symmetrically, 1 = (1 — a)o*, hence the result. . 


For X C A*, we denote by X the characteristic series of X defined by 


1 ifzeXx, 
(X= TS 
0 otherwise. 


We consider the characteristic series X of X as an element of N((A)). When X = {x} 
we usually write x instead of x. In particular, since the family (£)vex is locally finite, 
we have X = J „ey x. More generally, we have for any series o € K (AJ, 


o= D (o,w)w. 


we A* 
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PROPOSITION 6.2 Let X,Y C A*. Then 


0 ifwg XUY, 
(X+Y,w)=41 ifwe(X\Y)U(Y\X), 
2 ifweXAY. 


In particular, with Z = X UY, 


X+Y =Z ifandonyif XAY =Í. 


Given two sets X,Y C A*, the product XY is said to be unambiguous if any word 


w € XY has only one factorization w = xy with x E€ X,y EY. 


PROPOSITION 6.3 Let X,Y C A*. Then 
(X Y,w) = Card{ (x,y) € X x Y | w = £y}. 


In particular, with Z = XY, 
Z=XY 


if and only if the product XY is unambiguous. 


The following proposition approaches very closely the main subject of this book. 


PROPOSITION 6.4 For X C AT, we have 
((X)*,w) = Card{(z1,..., £n) | n > 0, z; E€ X, W = T122- En}. 


Proof. By the definition of (X )* we have 


7.3 
Applying Proposition Bt we obtain 


((X.)*,w) = Card{(a1,22,..., £p) | £i E X, W = 2x2... £k}. 


-6 
whence Formula (oa 


EXAMPLE 6.1 


AN =(1-A 


t= SO wand (A*A*,w) = 14 |w]. 
we A* 


(0.6.6) [eq0.7.6] 
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We now define the Hadamard product of two series o,r € K((A)) as the series o © T 
given by 
(0 ©7T,w) = (o, w)(7, w). 


This product is distributive over addition, viz. 
cO(t+7)=cOT+00T'. 
If the semiring K satisfies ry = 0 => x = 0 or y = 0, then 
supp(o © T) = supp(c) N supp(rT) . 
In particular, for X,Y C A* and Z=XNY, 
Z=XOY. 


Given two series 0,7 E€ Z((A)) we write o < r when (o, w) < (7, w) for all w € A*. 


7 Multiplicity in automata 


Let A be an alphabet. With each automaton A = (Q,/,T) over A is associated a 
function denoted by ua 


pa: A> NEXS 


defined by 
0 otherwise. 


(p, uala), q) = Ei 


This function extends into a morphism, epoted MA, from A* into the monoid 
N@*® of N-relations over Q (see Section n ae we have 


uall) = Ig 


and for u,v € A* 


(p, palu), q) = X` (p, nalu), r)(r, ual), q). 


rEQ 


The morphism py is called the representation associa ed with 4 The correspondence 
between u4 and the morphism y4 defined in Section Wis given by: 


(p,q) € palw) = (p, ualw), q) £0. 
PROPOSITION 7.1 Let A = (Q,I,T) be an automaton over A. For all p,q E€ Q and 


w € A*, (p, ualw),q) is the (possibly infinite) number of paths from p to q with label 
w. 7 
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A path c: i —> t is called successful if i € I and t € T. The behavior of the automaton 
A= (Q,I,T) is the formal power series |A| defined by 


(Aw) = D G malw),t) (0.7.1) 


ie€l,tEeT 


The set recognized by A is the support of |A|. It is just the set of all labels of successful 
paths. It is denoted by L(|A|), as in Section O 


PROPOSITION 7.2 Let A = (Q,I,T) be an automaton over A. For all w € A*, (|A|, w) 
is the (possibly infinite) number of successful paths labeled by w. n 


a 
A more compact writing of formula B tonsists in 


(LA|, w) = Ina(w)T. (0.7.2) 


Here, the element I € MÈ is considered as a row vector and T € N® as a column 


vector. 
a 
Figure 0.5: The Fibonacci automaton. 401 


1 
EXAMPLE 7.1 Let A be the automaton given by Figure Pat with l=T= {1}. Its 
behavior is the series 
|A] = So fra”, 


n>0 
where fn is the nth Fibonacci number. These numbers are defined by 
fo= fi =1, 
fn+1 = fn + fn-1, n>. 
For n > 1, we have 


pacar) =| fr] 


with f- =0. 


Let A = (Q,I,T) be an automaton. A state q € Q is accessible (resp., coaccessible) 
if there exists a path c : i > q with i € I (resp., a path c : q > t with t € T). An 
automaton is trim if all states are both accessible and coaccessible. The trim part of 
|A| = (Q, I,T) is the automaton 


A? = (P,IQAP,TAP), 


which is the restriction of A to the set P of states of |A| which are both accessible and 
coaccessible. It is easily seen that 


|A]? = A]. 
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PROPOSITION 7.3 Let A = (Q,I,T) be a finite automaton over A. For each integer d, 


the set {w € A* | (A|, w) = d} is regular. 


Proof. Let M be the monoid of Q x Q-matrices over the semiring B(d). For each word 
w, let a(w) be the Q x Q-matrix over B(d) obtained from u 4(w) by replacing each entry 
ualw)p q by min(d+ 1, 4.4(w)p,q)- Since such a replacement is a morphism from M onto 
B(d), the mapping a is a morphism from A* into M. The set {w € A* | (A|, w) = d} is 
recognized by a; it is indeed the set of words w such that Ia(w)T (computed in B(d)) 
equals d. n 


To each automaton A = (Q,I,T), we associate an automaton A* by a canonical 
construction consisting of the two following steps. Let w Z Q be a new state, and let 


B = (QU w,w,w) (0.7.3) 
be the automaton with edges 
G=FUTUTUO, 


where F is the set of edges of A, and 


T = {(w,a,q) | Ji EI: (i,a,q) EF} (0.7.4) 
T = {(¢,a,w) | dt e T: (g,a,t) E F} (0.7.5) 
O = {(w,a,w) | ie I,t eT: (i,a,t) €E F}) (0.7.6) 


By definition, the automaton A* is the trim part of B. 

The following terminology is convenient for automata of the form A = (Q, 1,1) having 
just one initial state which is also the unique final state. 

A path 


w 
c:p—>q 
is called simple if it is not the null path (i.e., w € A*) and if for any factorization 
u v 
c:p—r—q 


of the path c into two nonnull paths, we have r Æ 1. 
Any path c from p to q either is the null path or is simple or decomposes in a unique 


Manner as 
han £2 


> 1 


plete 34 sg, 


cC:p twi 

where each of these n + 2 paths is simple. 
PROPOSITION 7.4 Let X C At, and let A be an automaton such that |A| = X. Then 
at] = (X). (0.7.7) 


3 
Proof. Since A* is the trim part of the automaton B defined by formula HES suffices 
to show that |B] = |A|*. 
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Let S be the power series defined as follows: for all w € A*, (S,w) is the number of 
simple paths from w to w labeled with w. By the preceding remarks, we have 


\B|=S". 
Thus it remains to prove that 
S=xX. 


Let w € A*. If w = 1, then 
(5,1) = (X,1) =0, 


since a simple path is no l. If w =a E A, then (S,a) = 1 if and only ifa € X, 
according to formula ( . Assume now |w| > 2. Set w = aub with a,b € A and 
u € A*. Each simple path c: w —> w factorizes uniquely into 


in A. This path is unique because the behavior of A is a characteristic series. If there 
is another simple path c’ : w —> w in B, then there is also another successful path 
labeled w in A; this is impossible. Thus there is at most one simple path c : w —> w 
in B and such a path exists if and only if w € X. Consequently, S = X , which was to 
be proved. n 


2 
EXAMPLE 7.2 Let X = {a,a?}. Then X = |A| for the automaton given in Figure bro 
with J = {1}, T = {3}. The automaton A* is the automaton of Figure up toa 
renaming of w. Consequently, for n > 0 


Figure 0.6: 


EXAMPLE, 7,3 Let X = {aa, ba, baa, bb, bba}. We have X = |A| for the automaton A 
of Figure 0.7, with J = {1}, T = {4}. The corresponding automaton A* is given in 
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4_03 


Figure 0.8: 4_04 


Let A be an alphabet and let K be a semiring. A finite weighted automaton A = 
(Q,I,T) over the alphabet A and with weights in K is given by a finite set Q with two 
mappings J,7':Q— K and by a mapping 


E:QxAxQ-K. 


If E(p,a,q) = k £0, then we say that (p,a,q) is an edge with label a and weight k and 
F ka F 
we write p — q. If cis the path 


kya, knan 
> 


qi > 75 > An-1 >q 


then its label is z = a1 -+ -an and its weight is |c| = kı +--+ + kn. We write c : p Zq 
for denoting such a path. The behavior of A is the series defined by 


(As) = X I@)lelT(a)- 


zx 
c:p—q 


Since for each x € A*, there are only finitely many paths with label x, the sum is well 
defined. The behavior is also called the series recognized by the weighted automaton. A 
series u is called K-rational if it is the behavior of a weighted automaton with weights 
in the semiring K. We will be particularly interested in N-rational series. 


EXAMPLE 7.4 Any automaton can be viewed as a weighted automaton with weights in 
the Boolean semiring 5, or in the semiring N. In this case, the behavior is the number 
of successful paths. 
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Figure 0.9: A weighted automaton over a single letter alphabet. 


EXAMPLE 7.5 The weighted automaton of Figure fo tas integer weights and a one 
letter alphabet. For simplicity, the letter is not specified, and the weight of an edge is 
not indicated if it is 1. The behavior on the word of length n is n?. 

Indeed, denote by un, Un, Wn the sum of the weights of the paths of length n ending 
in 3 and starting in 1, 2, 3 respectively. We have w, = 1 for all n > 0. Next, the 
form of the automaton shows that Un+1 = Un + Wn for n > 0, whence vn = n. Finally 


Unt1 = Un + 2w — n + wn, and thus un = n? for n > 0. 


PROPOSITION 7.5 For any regular subset X of A*, the series ase Card(X N A™)z” 
is N-rational. 


Proof. Let A be a deterministic finite automaton recognizing X, and let B be the 
weighted automaton obtained by replacing all labels in A by the symbol z. Clearly 6 
recognizes the series „>o Card(X N A”)z”. m 


Let A = (Q, I,T) be a weighted automaton. When J is a singleton, that is I(i) = 1 
for some i € Q, and I(q) = 0 for q 4 i, we write i instead of J. The same convention 
holds for T. 

A weighted automaton A = (Q,i,t) is said to be trim if for each vertex q, there is a 
path from i to q and a path from q to t. It is said to be normalized if in addition no 
edge enters i, no edge leaves t, and i Æ t. Any N-rational series u with zero constant 
term can be recognized by a normalized weighted automaton. 

Given a series u = 0,59 Un2” with integer coefficients and with zero constant term 
uo = 0, we recall that u* denotes the series defined by u*(z) = 1/(1 — u(z)). 


PROPOSITION 7.6 Letu = „>o Unz” be an N-rational series with zero constant term. 
Let A = (Q,i,t) be a normalized weighted automaton recognizing u. Let Q = Q\t and 
let A = (Q,i,1) be the weighted automaton obtained by merging i and t. The behavior 
of A is the series u*. 


Proof. A path from i to i is called simple if it does not go through i inbetween. For 
each n > 0, Un is the sum of the weights of the simple paths of length n from i to i 
in A. Indeed, since A is normalized, to each simple path @: i — i in A corresponds a 
unique path from i to t in A, and conversely. 

Next, for r > 1, let ul”) be the sum of the weights of the paths from i to i that go 
exactly (r — 1) times through i inbetween. Set u\")(z) = 59 ul) 2” and uO (z)=1 
The series u“)(z) = 7,55 u®?) (2) is the behavior of A. 7 
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Next, u") (z) = u(z)” for r > 0. Since u*(z) = JCr>0 U(z)", we obtain u* (z) = ul) (z). 


Observe that this proposition is related to Proposition ea which can be used to give 
an alternative proof. Indeed, if A = (Q,i,t) is a normalized automaton, then, in the 
automaton A%*, state i is no longer accessible and state t is no longer coaccessible. Thus 
the trimmed automaton is identical with A. 


EXAMPLE 7.6 Let ulz) = z + z?. The weighted automaton A with A given on the 
left of Figure HiT recognizes u with i = 1 and t = 3. The weighted automaton A is 


Ce.» aw 


Figure 0.10: A weighted automaton recognizing z + z? and 1/(1 — z — 2°) 


represented on the right side. 


The following statement relates weighted automata with weights in N with nonneg- 
ative matrices. The adjacency matriz of the weighted automaton A = (Q, I, T) is the 
Q x Q matrix defined by 


Mp = D E(p,a,q). 
acA 


where E(p,a,q) is the weight of the edge (p, a, q). 


PROPOSITION 7.7 Let u(z) = $ ` >o Un2” be an N-rational series recognized by a trim 
weighted automaton and let M be the adjacency matrix of A. The radius of convergence 
of the series u(z) is the inverse of the maximal eigenvalue of M. 


Proof. Let À be the xjmal eigenvalue of M, which exists and is positive by the Perron- 
j I. Let p be th 


Frobenius Theorem e radius of convergence of the series u(z) and, for 
each p,q € Q, let pp, be the radius of convergence of the series upq(z) = Xn Mpg” 
Then 1/X = min ppg since >), M"z” converges for |z| < 1/A. Next, since A is trim, 
the series up (z) converges whenever u(z) converges; thus pyq > p for all p,q € Q. On 
the other hand p > min pp, since u is a nonnegative linear combination of the series 
Sp- Thus p = min pp, which concludes the proof. n 


igQ.2 
EXAMPLE 7.7 The weighted automaton A of Figure Pfi recognizes the series u(z) = 
1 — 2? /(1 — 2z?). The radius of convergence of u(z) is v2/2. The adjacency matrix of 
Ais 


0 10 
1 0 1 
0 10 


The eigenvalues are 0 and +v2. 
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Figure 0.11: A weighted automaton recognizing 1 — z?/(1 — 227). 


8 Ideals in a monoid 
Let M be a monoid. A right ideal of M is a nonempty subset R of M such that 
RMCR 


or equivalently such that for all r € R and all m € M, we have rm € R. Since M isa 
monoid, we then have RM = R because M contains a neutral element. A left ideal of 
M is a nonempty subset L of M such that ML C L. A two-sided ideal (also called an 
ideal) is a nonempty subset IJ of M such that 


MIM cl. 


A two-sided ideal is therefore both a left and a right ideal. In particular, M itself is an 
ideal of M. 

If M contains a zero, the set {0} is a two-sided ideal which is contained in any ideal 
of M. 

An ideal I (resp., a left, right ideal) is called minimal if for any ideal J (resp. left, 
right ideal) 

JcI>J=I1. 
If M contains a minimal two-sided ideal, it is unique because any nonempty intersection 
of ideals is again an ideal. If M contains a 0, the set {0} is the minimal two-sided ideal 
of M. An ideal I Æ 0 (resp., a left, right ideal) is then called 0-minimal if for any ideal 
J (resp. left, right ideal) 
JcI=sJ=0lorJ=I1. 


For any m € M, the set 
R=mM 


is a right ideal. It is the smallest right ideal containing m. In the same way, the set 
L = Mm is the smallest left ideal containing m and the set J = MmM is the smallest 
two-sided ideal containing m. 

We now define in a monoid M four equivalence relations L, R, J and H as 


mRm iff mM=m'M, 

mlm iff Mm= Mm’, 

mIm iff MmM = Mm'M, 

mHm iff mM=m'M and Mm=Mm/’. 
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Therefore, we have for instance, mRm’ if and only if there exist u,u’ € M such that 
m=mu, m=m/u'. 


We haveRCI,£L£C7,andH=RNE. 


Figure 0.12: RL = LR 


PROPOSITION 8.1 The two equivalence R and L commute: 


RL=CLR. 
Proof. Let m,n E€ M be such that MRLn. There exists p E€ M such that mRp, pln 
(see Figure . There exist by the definitions, u,u’,v,v’ € M such that p = mu, 


m= pu',n=vp,p=v'n. Set q = um. We then have 
q = vm = v(pu’) = (vp)u! = nu’ ‚n = vp = v(mu) = (vm)u = qu. 
This shows that qRn. Furthermore, we have 
m=pu =(u'n)u = v' (nv') = v'q. 


Since q = vm by the definition of q, we obtain m£Lq. Therefore MLqRn and con- 
sequently m£Rn. This proves the inclusion RL C LR. The proof of the converse 
inclusion is symmetrical. n 


Since R and £ commute, the relation 
D=RL=LR 
is an equivalence relation. We have the inclusions 
HCR,LCDCI. 


The classes of the relation called D-classes, can be represented by a schema called 
an ”egg-box” as in Figure a 

The R-classes are figured by rows and the £-classes by columns. The squares at the 
intersection of an R-class and an £-class are the H-classes. 

We denote by L(m), R(m), D(m), H(m), respectively, the £L, R, D, and H classes of 
an element m € M. We have 


H(m) = R(m) N L(m) and R(m), L(m) c D(m). 
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Figure 0.13: A D-class 


PROPOSITION 8.2 Let M be a monoid. Let m,m’ € M be R-equivalent. Let u,u’ € M 
be such that 


The mappings 
Pu: q> qu, pw: € > qu 


are bijections from L(m) onto L(m') inverse to each other which map an R-classe onto 
itself. 


Proof. We first verify that p, maps L(m) into L(m’). If q € L(m), then Mq = Mm 
and therefore Mqu = Mmu = Mm’. Hence qu = p,(q) is in L(m’). Analogously, py 
maps L(m’) into L(m). 

Let q E L(m) and compu ‘Pu(q). Since q£m, there exist v,v’ € M such that 
q = vm, m = v'q (see Figure . We have 


PulPulq) = quu’ = (um)uu! 


=vu(mu)u' = vmu =vm =q 


This shows that pw pu is the identity on L(m). We show in the same way that pupw is 
the identity on L(m’). 

Finally, since quu’ = q for all q € L(m), the elements q and p,(q) are in the same 
R-class. 


- 5.2 ; a setts 
Propasition Boa has the following consequence which justifies the regular shape of 
Figure fae 


PROPOSITION 8.3 Any two H-classes contained in the same D-class have the same 
cardinality. x 


We now come to the point of locating the idempotents in an ideal. The first result 
describes the H-class of an idempotent. 


PROPOSITION 8.4 Let M be a monoid and let e € M be an idempotent. The H-class 
of e is the group of units of the monoid eMe. 
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Figure 0.14: The reciprocal bijections 


Proof. Let m € H(e). Then, we have for some u, u’,v,v' € M 
e€e=mu,m=eu,e=um,m=v'e. 


Therefore em = e(eu’) = eu’ = m and in the same way me = m. This shows that 
m E€ eMe. Since 
m(eue) = mue = e, (eve)m = evm =e 


the element m is both right and left invertible in M. Hence, m belongs to the group of 
units of eMe. Conversely, ifm € eMe is right and left invertible, we have mu = vm = e 
for some u,v € eMe. Since m = em = me, we obtain mHe. n 


PROPOSITION 8.5 An H-class of a monoid M is a group if and only if it contains an 
idempotent. 
Proof. Let H be an Hzglass pf M. If H contains an idempotent e, then H = H(e) isa 
group by Proposition ~The converse is obvious. m 
PROPOSITION 8.6 Let M be a monoid and m,n € M. Then mn € R(m) A L(n) if and 
only if R(n) N L(m) contains an idempotent. 


Proof. If R(n) N L(m) contains an idempotent e, then 


/ / 
e=nu,n=eu,e=vm,m=ve 


for some u,u’,v,v’ € M. Hence 


mnu = m(nu) = me = (v'eje = ve =m, 
so that mn Rm. We show in the same way that mn£n. Hence mn € R(m) N (p) 
Conversely, if mn € R(m) N L(n), then mnRm and nlmn. By Proposition e€ 
multiplication on the right by n is a bijection from L(m) onto L(mn). Since n € L(mn), 
this implies the existence of e € L(m) such that en = n. Since the multiplication by 
n preserves R-classes, we have additionally e € R(n). Hence there exists u € M such 
that e = nu. Hence 
nunu = enu = nu 


and e = nu is an idempotent in R(n) N L(m). n 
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PROPOSITION 8.7 Let M be a monoid and let D be a D-class of M. The following 
conditions are equivalent. 


(i) D contains an idempotent. 
(ii) Each R-class of D contains an idempotent. 
(iii) Each £L-class of D contains an idempotent. 


Proof. Obviously, only (i) = (ii) requires a proof. Let e € D be an idempotent. Let R 
be an R-cla D. The H-class H = L(e)M R is nonempty. Let n be an element of H 
(See Figure ). Since nLe, there exist v, v’ € M such that 


Let m = ev’. Then mn = e because 
mn = (ev')n = e(u'n) = ee =e. 


Moreover, we have mRe since mn = í and m = ev’. Therefore, e = mn is in R(m) N 


L(n). This implies, by Proposition 8-6, that R = R(n) contains an idempotent. n 
e m 
v Vv 
R n 


Figure 0.15: Finding an idempotent in R 
se Pe an Bte 
A D-class D satisfying one of the conditions of Proposition is Called regular. 


PROPOSITION 8.8 Let M be a monoid and let H be an H-class of M. The two following 
conditions are equivalent. 

(i) There exist h,h’ € H such that hh’ € H. 

(ii) H is a group. 


-5.6 
Proof. (i) => ji). If hh’ € H, then by Proposition Bq? contains an idempotent. By 
Proposition 8.5, it is a group. 
(ii) > (i) is obvious. 7 


We now study the minimal and 0-minimal ideals in a monoid. Recall that if M 
contains a minimal ideal, it is unique. However, it may contain several 0-minimal 
ideals. 

Let M be a monoid containing a zero. We say that M is prime if for any m,n € M —O, 
there exists u € M such that mun 4 0. 


PROPOSITION 8.9 Let M be a prime monoid. 

1. If M contains a 0-minimal ideal, it is unique. 

2. If M contains a 0-minimal right (resp., left) ideal, then M contains a 0-minimal 
ideal; this ideal is the union of all 0-minimal right (resp. left) ideals of M. 

3. IfM both contains a 0-minimal right ideal and a 0-minimal left ideal, its 0O-minimal 
ideal is composed of a regular D-class and zero. 
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Proof. 1. Let I, J be two 0-minimal ideals of M. Let m € I\0 and let n € J\ 0. Since 
M is prime, there exist u € M such that mun 4 0. Then mun € J implies IN J # {0}. 
Since IN J is an ideal, we obtain JN J =I = J. 

2. Let R be a O-minimal right ideal. We first show that for all m € M, either 
mR = {0} or the set mR is a 0-minimal right ideal. In fact, mR is clearly a right 
ideal. Suppose mR # {0} and let R’ # {0} be a right ideal contained in mR. Set 
S={reR|mre R}. Then R’ = mS and S # {0} since R’ # {0}. Moreover, S is 
a right ideal because R’ is a right ideal. Since S C R, the fact that R is a 0-minimal 
right ideal implies the equality S = R. This shows that mR = R’ and consequently 
that mR is a 0-minimal right ideal. 

Let I be the union of all the minimal right ideals. It is a right ideal, and by the 
preceding discussion, it is also a left ideal. Let J 4 {0} be an ideal of M. Then for any 
0-minimal right ideal R of M, 


RICROICR. 


We have RJ # {0} since for any r € R\ 0 and m € J \ 0, there exists u € M such 
that rum 4 {0} whence rum € RJ\ 0. Since R is a 0-minimal right ideal and RN J 
is a right ideal distinct from {0}, we have RN J = R. Hence R C J. This shows that 
I c J. Hence J is contained in any nonzero ideal of M and therefore is the 0-minimal 
ideal of M. 

3. Let I be the 0-minimal ideal of M. Let m,n € I\ 0. By 2, the right ideal mM 
and the left ideal Mn are 0-minimal. Since M is prime, there exists u € M such that 
mun 4 0. The right ideal mM being 0-minimal, we have mM = munM and therefore 
mRmun. In the same way, mun£n. Hence we have mDn. This shows that I \ 0 is 
contained in a D-class. Conversely, if m € I\0,n € M and mDn, there exists ak € M 
such that mM = kM and Mk = Mn. Hence I = MmM = MkM = MnM and this 
implies n € I \ 0. This shows that J \ 0 is a D-class. ae 

Let us show that J \ 0 is a regular D-class. By Proposition ET Tis enough to prove 
that I \ 0 contains an idempotent. Let m,n € I \ 0. 

Since M is prime, there exists u E€ M such that mun Æ 0. Since the right ideal mM 
is 0-minimal and since mun # 0, we have mM = muM = munM. Whence mun € 
R(mu). Symmetrically, since Mn is a 0-minimal left ideal, we have Mn = Mun = se 
Mmun, whence mun € L(n). Therefore mun € R(mu) N L(n) and by Proposition Ro 
this implies that R(n) N L(mu) contains an idempotent. This idempotent belongs to 
the D class of n and therefore to J \ 0. a 


COROLLARY 8.10 Let M be a prime monoid. If M contains a 0-minimal right ideal 
and a 0-minimal left ideal, then M contains a unique 0-minimal ideal I which is the 
union of all the 0-minimal right (resp., left) ideals. This ideal is composed with a regular 
D class and 0. Moreover, we have the following computational rules. 

1. ForméeéI\0andneée M such that mn 4 0, we have 


mRmn. 
2. Forme I\0 andn Ee M such that nm 4 0, we have 


mLnm. 
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3. For any H class H CI \0 we have 


H? = H or H? = {0}. 

Proof. The first group of statements is an easy consequence of Proposition "Tet us 
prove 1. We have mnM C mM. Since mM is a 0-minimal right ideal and mn Æ 0, 
this forces the equality mnM = mM. The proof of 2 is symmetrical. Finally, to prove 
3, let us suppose H? Æ {0}. Let h,h’ € H be such that hh’ 4 0. Then, by 1 and 2, 
hRhh' and KW'Lhh'. Since he and h'Lhh', we have hLhh’. Therefore hh’ € H and H 
is a group by Proposition Ee n 

We finally give the statements corresponding to Proposition ‘ane Corollary Rig 
for minimal ideals instead of 0-minimal ideals. This is of course of interest only in the 
case where the monoid does not have a zero. 


| Propat@abmit | PROPOSITION 8.11 Let M be a monoid. 


1. If M contains a minimal right (resp. left) ideal, then M contains a minimal ideal 
which is the union of all the minimal right (resp. left) ideals. 

2. If M contains a minimal right ideal and a minimal left ideal, its minimal ideal I 
is a D-class. All the H-classes in I are groups. 


Proof. Let 0 be an element that does not belong to M and let Mo = M UO be the 
monoid whose law extends that of M in such a way that 0 is a zero. The monoid Mog 
is prime. 

An ideal I (resp. a right ideal R, a left ideal L) of M is minimal if and only if TU 0 
(resp. RUO, LUO) is a 0-minimal ideal (resp. right ideal, left ideal) of Mo. Moreover 
the restriction to M of the relations R,£,D,H in Mo coincide with the correspo ding 
relations i 5 Tperefore statements 1 and 2 can be deduced from Proposition BYE 
Corollary S i 


COROLLARY 8.12 Let M be a monoid containing a minimal right ideal and a minimal 
left ideal. Then M contains a minimal ideal which is the union of all the minimal right 
(resp. left) ideals. This ideal is a D class and all its H classes are groups. n 


9 Permutation groups 


In this section we give some elementary results and definitions on permutation groups. 
Let G be a group and let H be a subgroup of G. The right cosets of H in G are the 
sets Hg for g € G. The equality Hg = Hg’ holds if and only if gg'™t € H. Hence the 
right cosets of H in G are a partition of G. 

When G is finite, [G : H] denotes the index of H in G. This number is both equal to 
Card(G)/ Card(H) and to the number of right cosets of H in G. 

Let Q be a set. The symmetric group over Q composed of all the permutations of 
Q is denoted by Gg. For Q = {1,2,...,n} we write Gn instead of G{1,2,..n} A 
permutation is written to the right of its argument. Thus for g € Gg and q € Q the 
image of q by g is denoted by qg. 
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A permutation group over Q is any subgroup of Ng. For instance, the alternating 
group over {1,2,...,n}, denoted by 2,, is the permutation group composed of all even 
permutations. 

Let G be a permutation group over Q. The stabilizer of q E€ Q is the subgroup of G 
composed of all permutations of G fixing q, 


H={hE€G|qh=q}. 


A permutation group over Q is called transitive if for all p,q € Q, there exists g € G 
such that pg = q. 


PROPOSITION 9.1 1. Let G be a group and let H be a subgroup of G. Let Q be the set 
of right cosets of H inG. 


Q={Hg|geG}. 
Let p be the mapping from G into Gg defined for g € G and Hk € Q by 


(Hk)p(g) = (kg). 


The mapping p is a morphism from G into Gg and the permutation group (G) is 
transitive. Moreover, the subgroup (H) is the stabilizer of the point H € Q. 

2. Conversely, let G be a transitive permutation group over Q, let q E€ Q and let H 
be the stabilizer of q. 

The mapping y from G into Q defined by 


yY:g => qg 
induces a bijection a from the set of right cosets of H onto Q and for allk € G, g €G, 
a(Hk)g = a(Hkg). 


Proof. 1. The mapping ọ is well defined because Hk = Hk’ implies Hkg = Hk'g. It is 
a morphism since y(1) = 1 and 


(Hk)olg)olg') = (Hkg)p(g') = Hkgg' = (Hk) p(g9') . 
The permutation group y(G) is transitive since for k, k’ € G, we have 
(Hk)p(k-'k’) = HE’. 


Finally, for all h € H, y(h) fixes the coset H and conversely, if y(g) (g € G) fixes H, 
then Hg = H and thus g € H. 

2. Assume that Hg = Hg’. Then gg’! € H, thus qgg'"! = q, showing that qg = qq’. 
Thus y(g) = y(g’). This shows that we can define a function a by setting a(Hg) = ¥(q). 
Since G is transitive, y is surjective and therefore also a is surjective. To show that @ 
is injective, assume that a(Hg) = a(Hg'). Then qg = qg', whence qgg~! = q. Thus 
gg fixes q. Consequently gg’! € H, whence Hg = Ha’. 

The last formula is a direct consequence of the fact that both sides are equal to gkg. 


Let G be a transitive permutation group over a finite set Q. By definition, the degree 
of G is the number Card(Q). 
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PROPOSITION 9.2 Let G be a transitive permutation group over a finite set Q. Let 
q E Q and let H be the stabilizer of q. The degree of G is equal to the index of H in G. 


8.1 
Proof. The function a: Hg + qg of Proposition Baz} is a bijection from the set of 
right cosets of H onto Q. Consequently Card(Q) = [G : H]. n 


Two permutation groups G over Q and G’ over Q’ are called equivalent if there exists 
a bijection a from Q onto Q’ and an isomorphism ọ from G onto G” such that for all 
qEQandgEG, 
a(qg) = a(q)p(g) 


or equivalently, for g' € Q’ and g E€ G, 


q'olg) = alla (q'))g). 


As an example, consider a permutatio rgup G over Q and let H be the stabilizer of 
some qin Q. According to Proposition his group is equivalent to the permutation 
group over the set of right cosets of H obtained by the action of G on the cosets of H. 

Another example concerns any two stabilizers H and H’ of two points q and q’ ina 
transitive permutation group G over Q. Then H and H’ are equivalent. Indeed, since 
G is transitive, there exists g € G such that qg = q’. Then g defines a bijection a from 
Q onto itself by a(p) = pg. The function y : H — H’ given by y(h) = g7thg is an 
isomorphism and for all pE Q, he H, 


a(ph) = a(p)p(h). 


Let G be a transitive permutation group over Q. An imprimitivity equivalence of G is 
an equivalence relation 0 over Q that is stable for the action of G, i.e., such that for all 
gEG, 

p = q mod 0 = pg = qg mod 0. 


The partition associated with an imprimitivity equivalence is called an imprimitivity 
partition. 

Let 0 be an imprimitivity equivalence of G. The action of G on the classes of 0 defines 
a transitive permutation group denoted by Gg called the imprimitivity quotient of G 
for 8. 

For any element q in Q, denote by [q] the equivalence class of q mod 0, and let K; 
be the transitive permutation group over |q] formed by the restrictions to [q] of the 
permutations g that globally fix [q], i.e., such that [q]g = [q]. 

The group K; is the group induced by G on the class [q]. 

We prove that the groups Ky, q € Q all are equivalent. Indeed let q,q' € Q and 
g € G be such that qg = q’. The restriction a of g to fq] is a bijection from fq] onto 
[q']. Clearly, a is injective. It is surjective since if p = q’ mod 6, then pg~! = q mod 0 
and a(pg~') = p. Let y be the isomorphism from K} onto Ky defined for k € Kg 
by p'y(k) = a(a71(p')k). This shows that the groups Kq and Ky are equivalent. In 
particular, all equivalence classes mod 0 have the same number of elements. 

Any of the equivalent transitive permutation groups Ky is called the induced group 
of G on the classes of and is denoted by G°. 
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Let d = Card(Q) be the degree of G, e the degree of Gg, and f the degree of G°. 
Then 
d=ef. 


Indeed, e is the number of classes of 0 and f is the common cardinality of each of the 
classes mod @. 

Let G be a transitive permutation group over Q. Then G is called primitive if the 
only imprimitivity equivalences of G are the equality relation and the universal relation 
over Q. 


PROPOSITION 9.3 Let G be a transitive permutation group over Q. Let q € Q and H 
be the stabilizer of q. Then G is primitive if and only if H is a maximal subgroup of G. 


Proof. Assume first that G is primitive. Let K be a subgroup of G such that H C 
K C G. Consider the family of subsets of Q having the form qKg for g € G. Any 
two of these subsets are either disjoint or identical. Suppose indeed that for some 
k,k’ € K and g,g' € G, we have qkf = qk'g'. Then qkgg’'k'-! = q, showing 
that kgg’'k’-! € H C K. Thus gg’! € K, whence Kg = Kg! and consequently 
qKg = qKq’. Thus the sets qKg form a partition of Q which is clearly an imprimitivity 
partition. Since G is primitive this implies that either qK = {q} or qK = Q. The first 
case means that K = H. In the second case, K = G since for any g € G there is some 
k € K with qk = qg showing that gk~'! € H C K which implies g € K. This proves 
that H is a maximal subgroup. 

Conversely, let H be a maximal subgroup of G and let 0 be an imprimitivity equiv- 
alence of G. Let K be the subgroup 


K = {k € G | qk = q mod 0}. 


Then H C K C G, which implies that K = H or K = G. If K = H, then the class of q 
is reduced to q and @ is therefore reduced to the equality relation. If K = G, then the 
class of q is equal to Q and @ is the universal equivalence. Thus G is primitive. n 


Let G be a transitive permutation group on Q. Then G is said to be regular if 
all elements of G — {1} have no fixed point. It is easily verified that in this case 


Card(G) = Card(Q). 


PROPOSITION 9.4 Let G be a transitive permutation group over Q and letq E Q. The 
group G is regular if and only if the stabilizer of q is a singleton. 


Let k > 1 be an integer. A permutation group G over Q is called k-fold transitive(or 
k-transitive) if for all k-tuples (p1,p2,...,pr) E€ Q® and (q1, q2, ..-, dk) E QË composed 
of distinct elements, there is a g € G such that pig = q1, P29 = q2,- . -, PRI = qk- 

The 1-transitive groups are just the transitive groups. Any k-transitive group for 
k > 2 is clearly also (k — 1) transitive. The group G,, is n-fold transitive. 


PROPOSITION 9.5 Let k > 2 be an integer. A permutation group G over Q is k- 
transitive if and only if G is transitive and if the restriction to the set Q — {q} of the 
stabilizer of q E€ Q is (k — 1)-transitive. 
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Proof. The condition is clearly necessary. Conversely assume that the condition is 
satisfied and let (p1, p2,..., pp) € QF, and (q1,q2,---,d) E QF be k-tuples composed 
of distinct elements. Since G is transitive, there exists a g € G such that pyg = qı. Let 
H be the stabilizer of qı. Since the restriction of H to the set Q — {qi} is (k — 1)-fold 
transitive, there is an h € H such that pogh = qo,...,prgh = qk. Since pigh = q1, the 
permutation g’ = gh satisfies p1g' = q1,pog' = q2,.--., preg’ = qk. This shows that G is 
k transitive. m 


A 2-transitive group is also called doubly transitive. 


pp26rang| PROPOSITION 9.6 A doubly transitive permutation group is primitive. 


Proof Let G be a doubly transitive permutation group over Q and consider an imprim- 
itivity equivalence 0 of G. If 0 is not the equality on Q, then there are two distinct 
elements q,q' E€ Q such that q = q' mod 0. Let q” € Q be distinct from q. Since G 
is 2-fold transitive, there exist g € G such that qg = q and q'g = q”. Since @ is an 
imprimitivity equivalence we have q = q” mod 8. Thus @ is the universal relation on Q. 
This shows that G is primitive. n 


-8.6 
The converse of Proposition Ba is false. Indeed, for any prime number p, the cyclic 
group generated by the permutation (12---p) is pri itive hut is not doubly transitive. 
An interesting case where the converse of Propositie R ue is described in a famous 


theorem of Schur that will be stated in Chapter E 


10 Notes 


Each of the subjects treated in this chapter is part of a theory that we have considered 
only very superficially. A more complete exposition about words can be found in 


E. | 
tion0.4 4.9 
For automata (Section Hj we Toi ow the notation of (E974). Theorem ES 


is due to S. Kleene. 


Our definition of a complete semiring is less general than that of (L974) 

but it will be enou r our purposes. The full statement of the Perron-Frobenius 
=PerronFrobernius | . g en 

theorem (Theorem includes additional statements, including the description of 

the eigenvalues with maximal modulus ( see ((1959)) The function rm is 


sometimes known as the Wielandt function. l ction l E 
Our presentation of ideals in monoids (Section bj is developed with more detail in 


lifford and Preston] (L961) or (L979). The notion of a prime monoj is not 
Bi The 


classical but it is well fitted to the situation that we shall find in Chapter [V]. 
0-minimal ideals of prime monoids are usually called completely 0-simple semigroups. 


For semirings and formal series see (L974) or [Berstel and Reutenauer| (L984). 
A classical textbook on permutation groups is (L964). 
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Codes 


0 Introduction 


The definitions and some important general properties of codes are presented in this 
chapter. The first two se tions contain several equivalent definitions of codes and free 
submonoids. In Section ectiont a a method for verifying that a given set of words 
is a code. A ore appropriate framework to answer this question will be given in 
Chapter Me aytgmata with multiplicities will be considered. 

In Section EPA duce the notion of a Bernoulli distribution over an alphab 
This allows us to give a necessary condition for a set to be a code (Theorem ; 
The questions about probabilities rai ed in this and in the following section will be 
developed in more depth in Chapter Re 

Section O introduces the concept of a complete set. This is in some se e notion 
dual to that of a code. The main result of this chapter (Theorem escribes 
complete codes by using results on Bernoulli distributions developed previously. In the 
last section the operation of composition of codes is introduced and several properties 
of this operation are established. 


1 Definitions 


This section contains the definitions of the notions of code, prefix (suffix, bifix) code, 
maximal code, and coding morphism and gives examples. 
Let A be an alphabet. A subset X of the free monoid A* is a code over A if for all 


n,m > 1 and z1,..., Zn, £1.. , Zia € X, the condition 
LIL- En = 105° Lla (1.1.1) 
implies 
n=m and a,=2, for i=1,...,n (1.1.2) 


In other words, a set X is a code if any word in X* can be written uniquely as a 
product of words in X, that is, has a unique factorization in words in X. Since 1.1 = 1, 
a code never contains the empty word 1. It is clear that any subset of a code is a code. 
In particular, the empty set is a code. 

The definition of a code can be rephrased as follows: 


43 


ta. 
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PROPOSITION 1.1 If a subset X of A* is a code, then any morphism 8 : B* — A* 
which induces a bijection of some alphabet B onto X is injective. Conversely, if there 
exists an injective morphism 3: B* — A* such that X = (B), then X is a code. 


Proof. Let 6: B* — A* be a morphism such that 8 is a bijection of B onto X. Let 
u,v € B* be words such that G(u) = G(v). If u = 1, then v = 1; indeed, 6(b) 4 1 for 
each letter b € B since X is a code. If u #1 and v Æ 1, set u = b1 -+ - bn, v = bi e UL, 
with n,m > 1, bi, ...,bn, b1,- -- bn E B. Since 8 is a morphism, we have 


B(b1):-- B(bn) = B01) Bm) - 


But X is a code and 8(b;), (05) € X. Thus n = m and ((b;) = G(b;) for i = 1,...,n. 
Now £ is injective on B. Thus b; = b; for i = 1,...,n, and u = v. This shows that 8 
is injective. 

Conversely, if 8 : B* — A* is an injective morphism, and if 


E EEE EE A (1.1.3) 


for some n,m > 1 and 7,...,%n,2},..., 2}, € X = E(B), then we consider the letters 
bi, bj in B such that _8(b s Li, B(b,) = = Th, i = 1,...,n, j =1,...,m. Since 8 is 
injective, Equation (i i implies that bı -bn = bi: -bh Thus n = m and b; = Ui, 
whence z; = x; fori=1,...,n. n 


A morphism 3 : B* — A* which is injective and such that X = ((B), is called a 
coding morphism for X. For any code X C A*, the existence of a coding morphism for 
X is straightforward: it suffices to take any bijection of a set B onto X and to extend 


it to a morphi from B* into A 

Proposition tithe origin for the terminology since the words in X encode the let- 
ters of the set B. The coding procedure consists of associating to a word b1b2- - -bn (bj € 
B) which is the text in plain language an encoded or enciphered message (3(b1) --- 3(bn) 
by the use of the coding morphism 3. The fact that 8 is injective ensures that the coded 
text can be deciphered in a unique way to get back the original text. 


EXAMPLE 1.1 For any alphabet A, the set X = A is a code. More generally, if p > 1 
is an integer, then X = A? is a code called the uniform code of words of length p. 
Indeed, if elements of X satisfy Equation aie. the constant length of words in 
X implies the conclusion fst? e 


EXAMPLE 1.2 Over an alphabet A consisting of a single letter a, a nonempty subset 
X C a* is a code if and only if X is a singleton distinct from 1 ( = a®). ° 


EXAMPLE 1.3 The set X = {aa, baa, ba} over A = {a,b} is a code. Indeed, suppose the 
contrary. Then there exists a word w in X*, of minimal length, that has two distinct 


factorizations, 
1 


r es y 
W = T172: En = L13 Em 


(n,m > 1,2;,2, € X). Since w is of minimal length, we have xı # x. Thus zı isa 
proper prefix of x} or vice versa. Assume that xı is a proper prefix of x. By inspection 
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of X, this implies that zı = ba, x = baa. This in turn implies that z2 = aa, x, = aa 
Thus x, = 21a, xx = 22a, and if we assume that ziz): £p = 1122°++Xpa, it 
necessarily follows that rp41 = aa and zp}, = aa. Thus £123: p41 = T172 ++ Lp414. 
But this contradicts the existence of two factorizations. ° 


Figure 1.1: A double factorization starting 


EXAMPLE 1.4 The set X = {a,ab, ba} is not a code since the word w = aba has two 
distinct factorizations 


w = (ab)a = a(ba). e 
Aí 
The following corollary to Proposition i 7 j is useful. 


COROLLARY 1.2 Let a: A* — C* be an injective morphism. If X is a code over A, 
then a(X) is a code over C. If Y is a code over C, then a~! (Y) is a code over A. 


Proof. Let 8 : B* — A* be a coding morphism for X. Then a(5(B)) = a(X) and since 
ao B: B* — C* is an injective morphism, Proposition Eiis that a(X) is a code. 
Conversely, let X = a71 (Y), let n,m > 1, £1,- .- , En, T4,- -, 24, E X be such that 


/ 
Ti Ly = T Lm: 


Then 

a(x1): a(n) = a(21) i ALm): 
Now Y is a code; therefore n = m and a(x;) = a(x;) for i = 1,...,n. The injectivity 
of a implies that z; = z; for i = 1,...,n, showing that X is a code. n 


COROLLARY 1.3 If X C A* is a code, then X” is a code for all integers n > 0. 


Proof. Let 8 : B* — A* be a coding morphism for, X.. Then X” = 6( B”). But B” isa 
code. Thus the conclusion follows from Corollary [-2_ n 


EXAMPLE 1.5 We show that the product of two codes is not a code in general. To 
do this, we consider the sets X = {a, ba} and Y = {a,ab} which are easily seen to be 
codes over the alphabet A = {a,b}. Then 


Z = XY = {aa, aab, baa, baab} . 
The word w = aabaab has two distinct factorizations, 
w = (aa)(baab) = (aab)(aab) . 


Thus Z is not a code. ° 
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An important class of codes is the class of prefix codes to be introduced now. Let 
X be a subset of A*. Then X is a prefix set if no element of X is a proper prefix of 
another element in X. In an equivalent manner, X is prefix if for all z, x’ in X, 


To Se (1.1.4) 


This may be rephrased as: two distinct elements in X are incomparable in the prefix 
ordering. a 

It follows immediately from (Stay ihat a prefix set X containing the empty word 
just consists of the empty word. Suffix sets are defined in a symmetric way. A subset 
X of A* is suffix if no word in X is a proper suffix of another word in X. A set is bifix if 
it is both prefix and suffix. Clearly, a set of words X is suffix if and only if X is prefix. 


PROPOSITION 1.4 Any prefix (suffix, bifix) set of words X # {1} is a code. 


Proof. If X is not a code, then there is a word w of minimal length having two 
factorizations 


1d / $ 
W = tio Ln = TFs Em (zi, £j E€ X). 


Both z1, 2 are nonempty, and since w has minimal length, they are distinct. But then 
zı < x, or x < x contradicting the fact that X is prefix. Thus X is a code. The 
same argument holds for suffix sets. n 


A prefix code (suffix code, bifix code) is a prefix set (suffix, bifix set) which is a code, 
that is distinct from {1} . 


le1.1.5 
EXAMPLE 1.6 Uniform codes are bifix. The sets X and Y of Example i is are a prefix 
and a suffix code. 


EXAMPLE 1.7 The sets X = a*b and Y = {a"b” | n > 1} over A = {a,b} are prefix, 
thus prefix codes. The set Y is suffix, thus bifix, but X is not. This example shows the 
existence of infinite codes over a finite alphabet. 


A code X is maximal over A if X is not properly contained in any other code over 
A, that is, if 
XcX', X' code > X= X’. 


The maximality of a code depends on the alphabet over which it is given. Indeed, if 
X C A* and AG B, then X C B* and X is certainly not maximal over B, even if it 
is a maximal code over A. The definition of a maximal code gives no algorithm that 
allows us to verify that it is sa isfied. However, maximality is decidable, at least for 
recognizable codes (see Section Ff 


EXAMPLE 1.8 Uniform codes A” are maximal over A. Suppose the contrary. Then 
there is a word u € At — A” such that Y = A” U {u} is a code. The word w = u” 
belongs to Y*, and it is also in (A”)* because its length is a multiple of n. Thus 
w = u” = £122°+-Xy for some T1,..., Ljuj E A” . Now u ¢ A”. Thus the two 
factorizations are distinct, Y is not a code and A” is maximal. e 
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PROPOSITION 1.5 Any code X over A is contained in some maximal code over A. 


Proof. Let F be the set of codes over A containing X, ordered by set inclusion. To 
show that F contains a maximal element, it suffices to demonstrate, in view of Zorn’s 
lemma, that any chain C (i.e., any totally ordered subset) in F admits a least upper 
bound in F. 

Consider a chain C of codes containing X; then 


Y =UyecY 


is the least upper bound of C. It remains to show that Y is a code. For this, let 
n,m > 1,Y1,---5Yns Y4- +> Um E Y be such that 


Yio Yn =Yp Yhn. 


Each of the Yi, Vj belongs to a code of the chain C and this determines n + m ele- 
ments (not necessarily distinct) of C. One of them, say Z, contains all the others. Thus 
Yi- -Yn Y1 ---;Ym E Z, and since Z is a code,we have n = m and y; = y; for 
i=1,...,n. This shows that Y is a code. n 


21.5 
Proposition i i is no longer true if we restrict ourselves to finite codes. There exist 
finite codes which are contained ip no nitg maximal code. An example of such a code 
a A K : ectionl. 15. 
will be given in Section xample F 
The fact that a set X C A* is a code admits a very simple expression in the termi- 
nology of formal power series. 


PROPOSITION 1.6 Let X be a subset of At, and let M = X* be the submonoid gener- 
ated by X. Then X is a code if and only if M = (X)* or equivalently M = (1—X)7! 


Proof. According to Proposition fie the coefficient ((X)*, w) of a word w in (X)* is 
equal to the number of distinct factorizations of w in words in X. By definition, X isa 
code if and only if this number takes only the values 0 and 1 for any word in A*. But 
this is equivalent to saying that (X)* is the characteristic series of its support, that is, 
(XW =M. = 


2 Codes and submonoids 


The submonoid X* generated by a code X is sometimes easier to handle than the code 

itself. The fact that X is a code (prefix code, bifix code) is equivalent to the property 

that X* is a free monoid (a right unitary, biunitary monoid). These latter properties 

may be verified directly on the submonoid without any explicit description of its base. 

Thus we can prove that sets are codes by knowing only the submonoid they generate. 
We start with a general property. 


PROPOSITION 2.1 Let A be an alphabet. Any submonoid M of A* has a unique minimal 
set of generators X = (M — 1) — (M — 1}°. 
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Proof. Set Q = M — 1. First, we verify that X generates M, i.e., that X* = M. 
Since X C M, we have X* C M. We prove the opposite inclusion by induction on the 
length of words. Of course, 1 € X*. Let m € Q. If m ¢ Q?, then m € X. Otherwise 
m = mm, with mı, M2 E Q both strictly shorter than m. Therefore m1, M2 belong to 
X* and m € X*. 

Now let Y be a set of generators of M. We may suppose that 1 Y. Then each 
x € X is in Y* and therefore can be written as 


T = Y1Y2' Yn (yi € Y,n > 0) 


The facts that z # 1 and z ¢ Q? force n = 1 and z € Y. This shows that X C Y. 
Thus X is a minimal set of generators and such a set is unique. n 


EXAMPLE 2.1 Let A = {a,b} and let M = {w e€ A* | |wļa = 0 mod 2} Then we 
compute X = (M — 1) — (M — 1)? = bU ab*a. 


We now turn to the study of the submonoid generated by a code. By definition, a 
submonoid M of A* is free if there exists an isomorphism 


a: B* =M 
of a free monoid B* onto M. 


PROPOSITION 2.2 If M is a free submonoid of A*, then its minimal set generators is 
a code. Conversely, if X C A* is a code, then the submonoid X* of A* is free and X 
is its minimal set of generators. 


Proof. Let a : B* — M be an isomorphism, Then qa, considered as morphism from 
B* into A*, is injective. By Proposition Eek set X = a(B) is a code. Next 
M = a(B*) = (a(B))* = X*. Thus X generates M. Furthermore B = B+ — Bt Bt 
and a(Bt+) = M — 1. Consequently X = (M — 1) — (M — 1)?, showing that X is the 
minimal set of generators of M. 

Conversely, assume that X C A* is a code and consider a coding morphism a : 
B* — A* for X. Then a is injective and a is a bijection from B onto X. Thus aa 
bijection from B* onto a(B*) = X*. Consequently X* is free. Now a is bijection, 
thus B = Bt — B* Bt implies X = X* — X+ X*+, showing by Proposition STAA X 
is the minimal set of generators of M. n 


The code X which generates a free submonoid M of A* is called the base of M. 


COROLLARY 2.3 Let X and Y be codes over A. If X* = Y*, then X =Y. 


Hod: f , f . 

EXAMPLE 2 j (continued) The set X is a (bifix) code, thus M is a free submonoid of 

x 
= i . yk -2.2 . . . . 

According to Proposition PA we can distinguish two cases where a set X is not a 
code. First, when X is not the minimal set of generators of M = X*, that is, there 
exists an equality 

TL = %179*** En 
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with 2,2; E€ X and n > 2. Note that despite this fact, M might be free. The other 
case holds whe An is the minimal set of generators, but M is not free (this is the case 
of Example 

We now give a characterization of free submonoids of A* which is intrinsic in the 
sense that it does not rely on the bases. Another slightly different characterization is 
given in the exercises. 

Let M be a monoid. A submonoid N of M is stable (in M) if for all u,v,w € M, 


u,v, uw, wv EN >wEN. (1.2.1) 
The hypotheses of (REi nay be written as 
we NINANN E, 
thus the condition for stability becomes 
N INANNICN 


or simply 
N İNANN!=N (1.2.2) 


since 1 € N and therefore N c NIN AONN!. 
Figure e a pictorial representation of condition Ei when the elements 
u,v, w are words. The membership in N is represented by an arch. 


Figure 1.2: Representation of stability. 


Stable submonoids have interesting properties. As a result, ț ey appear in almost all 
of the chapters in this book. A reason for this is Proposition which yields a remark- 
able characterization of free submonoids of a free monoid. As a practical application, 
the proposition is used to prove that some submonoids are free, and consequently, that 
their bases are codes without knowing the bases. 


PROPOSITION 2.4 A submonoid N of A* is stable if and only if it is free. 


Proof. Assume first that N is stable. Set X = (N — 1) — (N —1)?. To prove that X 
is a code, suppose the contrary. Then there is a word z € N of minimal length having 
two distinct factorizations in words of X, 


Z = T1T2':: En = Y1Y2°°* Ym 


with z1,..., £n, Y1;---, Ym E X. We may suppose |x| < |y;|. Then yı = xıw for some 
nonempty word w. It follows that 


Ti, Y2... Ym, TIW = Yi, WY2':':` Ym = T2'': Tn 
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are all in N. Since N is stable, w is in N. Consequently y1 = zıw Z X, which yields 
the contradiction. Thus X is a code. 
Conversely, assume that N is free and let X be its base. Let u,v,w € A* and suppose 
that 
u,v, uw, wv EN. 


Set 


U= By Ek, WV = Tk41' Gp, 


Uw = Yio Y V =Y Ys. 
with z;, yj in X. The equality u(wv) = (uw)v implies 
Oyo REEL Lp = Y1 YY: Ys - 


Thus r = s and x; = y; (i = 1,...,s) since X is a code. Moreover, l > k because 
juw| > |u|, showing that 


hence w = £k41 -xe E N. Thus N is stable. m 


For a prefix code X, the submonoid X* is free, as we have seen, but has a stronger 
property. Those submonoids which are generated by prefix codes, can also be charac- 
terized by a condition which is independent of the base. 

Let M be a monoid and let N be a submonoid of M. Then N is right unitary (in 
M) if for all u,v € M, 

uuv E N> vEN. 


In a symmetric way, N is left unitary if for all u,v € M, 
uvu E N >vEN. 


The conditions may be rewritten as follows: N is right unitary if and only if NTIN = N, 
and N is left unitary if and only if NNT! = N. 

The submonoid N of M is biunitary if it is both left and right unitary. 

The four properties stable, left unitary, right unitary, and biunitary are of the same 
nature. Their relationships can be summarized as 


stable: NINA NN! = N 


A ®& 
left unitary: NNT = N NTIN = N: right unitary 
~ A 


biunitary: NNT = NIN = N 


2.1 
EXAMPLE E i (continued) The submonoid M is biunitary. Indeed, if u, uv € M then 
lula and |uvla = |ula + [vla are even numbers; consequently |v|_ is even and v € M. 
Thus M is right unitary. 
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EXAMPLE 2.2 In group theory, the concepts stable, unitary and biunitary collapse and 
coincide with the notion of subgroup. Indeed, let H be a stable submonoid of a group 
G. For all h € H, both hh! and h~'h are in H. Stability implies that h~! is in H. 
Thus H is a subgroup. If H is a subgroup, then conversely HH~! = HHH = H, 
showing that H is biunitary. 


The following proposition shows the relationship between the submonoids we defined 
and codes. 


PROPOSITION 2.5 A submonoid M of A* is right unitary (resp., left unitary, biunitary) 
if and only if its minimal set of generators is a prefix code (suffix code, bifix code). In 
particular, a right unitary (left unitary, biunitary) submonoid of A* is free. 


Proof. Let M C A* be a submonoid, Q = M — 1 and let X = Q — Q? be its minimal 
set of generators. Suppose M is right unitary. 

To show that X is prefix, let x, xu be in X for some u € A*. Then z,ru € M and 
thus u € M. If u Æ 1, then u € Q; but then zu € Q? contrary to the assumption. Thus 
u = 1 and X is prefix. 

Conversely, suppose X is prefix. Let u,v € A* be such that u, uv € M = X* Then 


U = Fy En, V= Yr Ym 
for some z1,..., En, Y1,- --, Ym E X. Consequently 
Ty EnY = Y1 Ym. 


Since X is prefix, neither xı nor yı is a proper prefix of the other. Thus zı = yı, and 
for the same reason £2 = Y2,...,£n = Yn. This shows that m > n and v = Yn41 `: Ym 
belongs to M. Thus M is right unitary. m 


Let M be a free submonoid of A*. Then M is maximal if M 4 A* and M is not 
properly contained in any other free submonoid excepted A*. 


PROPOSITION 2.6 If M is a maximal free submonoid of A*, then its base X is a maz- 
imal code. 


Proof. Let Y be a code on A with X $ Y. Then X* C Y* and X* # Y* since 
otherwise X = Y by Corollary me X* is maximal. Thus Y* = A* and Y = A. 
Thus X Ç A. Let b € A—X. The set Z = X Ub? is a code and M & Z* G A*. 
Both inclusions are strict since b? ¢ M and b ¢ Z*. This contradicts the maximality 
of M. n 


Note that the converse of the proposition is false since uniform codes A” (n > 1) are 
maximal. But if k,n > 2, we have (A’")* Ç (A")* © A*, showing that (A"*)* is not 
maximal. 

We now introduce a family of bifix codes called group codes which have interesting 
properties. Before we give the definition, let us consider the following situation. 

Let G be a group, H be a subgroup of G, and 


yp: A >G (1.2.3) 
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be a morphism. The submonoid 
M= yp 4 (H) (1.2.4) 


is biunitary. Indeed, if, for instance, p,pq E€ M, then y(p),y(pq) € H, therefore 
plp) tolpa) = plq) € H and q € M. The same proof shows that M is left unitary. 
Thus the base, say X, of M is a bifix code. à 

The definition of the submonoid M in Eiis equivalent to a description as the 
ein ES pf A* with a subgroup of the free group A® on A. Indeed, the morphism 


y in ( actorizes in a unique way in 


A* x ee 
\ A 
AO 


with 1 the canonical injection. Setting Q = ~~!(H), we have 


M= OMA". 


Conversely if Q is a subgroup of AS and M = QN A*, then 


M =.-1(Q). 
A group cod. js the base X of a submonoid M = y~!(H), where y is a morphism 
given by ( which, moreover, is supposed to be surjective. Then X is a bifix code 


and X is a maximal code. Indeed, if M = A*, then X = A is maximal. Otherwise 
take w € A* — M and setting Y = X Uw, let us verify that Y is not a code. Set 
m = p(w). Since y is surjective, there is word w € A* such that y(w) = m~t. The 
words u = wù, v = ww both are in M, and www = uw = wv € Y*. This word has two 
distinct factorizations in words in Y, namely, uw formed of words in X followed by a 
word in Y, and wv which is composed the other way round. Thus Y is not a code and 
X is maximal. 
We give now three examples of group codes. 


EXAMPLE 2.3 Let A = {a,b} and consider the set 
M = {w € A* | |wļa = 0 mod 2} 
of Example P We have M = y~1(0), where 
yp: A* > Z/2Z 


is the morphism given by y(a) = 1,y(b) = 0. Thus the base of M, namely the code 
X = bUab*a, is a group code, hence maximal. ° 


EXAMPLE 2.4 The uniform code A™ over A is a group code. The monoid (A™)* is 
indeed the kernel of the morphism of A* onto Z/mZ mapping all letters on the number 
1. e 
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EXAMPLE 2.5 Let A = {a,b}, and consider now the submonoid 
{w € A | |wla = |wh} (1.2.5) 
composed of the words on A having as many a’s as b’s. Let 
6:A* 5—Z 
be the morphism defined by ô(a) = 1,6(b) = —1. Clearly 
d(w) = |wla — [wl 


.5 
for all w € A*. Thus the set (Reais equal to d~!(0). The base of 5~!(0) is denoted 
by D or Dı, the submonoid itself by D* or D}. Words in D are called Dyck-primes, D 
is the Dyck code over A. The set D* is the Dyck set over A. ° 


EXAMPLE 2.6 More generally, let A = BU B (BN B = Ú) be an alphabet with 2n 
letters, and let 6 : A* — B® be the morphism of A* onto the free group B® defined by 
6(b) = b, 6(b) = b+ for b € B,b € B. The base of the submonoid 6~'(1) is denoted by 
Dry, and is called the Dyck code over A or over n letters. ° 


We now turn to a slightly different topic and consider the free submonoids of A* 
containing a given submon iq. , We start with the following observation which easily 
follows from Proposition P.A ~ 


PROPOSITION 2.7 The intersection of an arbitrary family of free submonoids of A* is 
a free submonoid. 


Proof. Let (M;)ier be a family of free submonoids of A*, and set M = NierMi. Clearly 
M is a submonoid, and it suffices to show that M is stable. If 


u, vw, uv, w E€ M 


then these four words belong to each of the M;. Each M; being stable, w is in M; for 
each i € I. Thus w € M. n 


Proposition Ett teads to the following considerations. Let X be a subset of A*. As 
we have just seen, the intersection of all free submonoids of A* containing X is again 
a free submonoid. It is the smallest free submonoid of A* containing X. We call it the 
free hull of X. If X* is a free submonoid, then it coincides of course with its free hull. 

Let X be a subset of A*, let N be its free hull and let Y be the base of N. If X is 
not a code, then X Æ Y. The following result, known as the defect theorem gives an 
interesting relationship between X and Y. 


THEOREM 2.8 Let X be a subset of A*, and let Y be the base of the free hull of X. If 
X is not a code, then 
Card(Y) < Card(X) — 1. 


The followi resylt is a consequence of the theorem. It can be proved directly as 
well (Exercise ate 
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COROLLARY 2.9 Let X = {21,22}. Then X is not a code if and only if xı and zə are 
powers of the same word: xı = y”,x2 = y1 for some y € At, p,q > 0. 


Note that this corollary entirely describes the codes with two elements. The case of 
sets with three words is already much more complicated. 
For the proof of Theorem 2.8, we first show the following result. 


PROPOSITION 2.10 Let X C A* and let Y be the base of the free hull of X. Then 
Y c XY AG) x, 


that is each word in Y appears as the first (resp., last) factor in the factorization of 
some word x E€ X in words belonging to Y. 


Proof. Suppose that a word y € Y is not in (Y*)~'X. Then X C 1U Y*(Y — y). 
Setting 

Z =y*(Y - y) 
we have Zt = Y*(Y — y), thus X C Z*. Now Z* is free. Indeed, any word z € Z* has 


a unique factorization 


Z = Y1Y2* Yn, Yi- -Yn EY, Yn FY 
and therefore can be written uniquely as 
z = yP zy? zg +s yP zp, Riga ep EY — y, p= 0. 


Now X C Z* Q Y*, showing that Y* is not the free hull of X. This yields the 
contradiction. 7 


Proof of Theorem ES FX contains the empty word, then X and X’ = X — 1 have 
same free hull Y*. If the result holds for X’, it also holds for X, since if X’ is a code, 
then Y = X’ and Card(Y) = Card(X) — 1, and otherwise Card(Y) < Card(X’) —1< 
Card(X) — 2. Thus we may assume that 1 Z X. Let a: X — Y be the mapping 
defined by 
alxz)=y if weyY”. 

This mapping is uniquely defined si ce Y is a code; it is everywhere defined since 
X c Y*. In view of Proposition Tae function qa is surjective. If X is not a code, 
then there exists a relation 


LIL: En = TILS 2, Ti, T) EX (1.2.6) 
6 
with zı 4 x. However, Y is a code, and by (Baa Wve have 
a(r1) = a(z). 


Thus a is not injective. This proves the inequality. n 
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3 A test for codes 


It is not always easy to verify that a given set of words is a code. The test described 
in this section is not based on any new property of codes but consists merely in a 
systematic organization of the computations required to verify that a set of words 
satisfies the definition of a code. 

In the case where X is finite, or more generally if X is recognizable, the amount 
of computation is finite. In other words, it is effectively decidable whether a finite 
(recognizable) set is a code. 

Before starting the description of the algorithm, let us consider an example. 


EXAMPLE 3.1 Let A = {a,b}, and X = {b, abb, abbba, bbba, baabb}. This set is not a 
code. For instance, the word 
w = abbbabbbaabb 


has two factorizations (see Fig. 
w = (abbba)(bbba) (abb) = (abb) (b) (abb) (baabb) . 


These two factorizations define a sequence of prefixes of w, each one corresponding to 
an attempt at a double factorization. We give this list, together with the attempt at a 
double factorization: 


(abbba) = (abb)ba 
(abbba) = (abb) (b)a 
(abbba)bb = (abb) (b) (abb) 
(abbba)(bbba) = (abb)(b)(abb) ba 
(abbba)(bbba)abb = (abb)(b)(abb)(baabb) 


(abbba)(bbba)(abb) = (abb)(b)(abb) (baabb) 


Each but the last one of these attempts fails, because of the rest corresponding to the 
italicized suffix, which remains after the factorization. ° 


Figure 1.3: Two factorizations 1_03 


The algorithm presented here computes all the remainders in all attempts at a double 
factorization. It discovers a double factorization by the fact that the empty word is one 
of the remainders. 

Formally, the computations are organized as follows. Let X be a subset of A+, and 
let 


Wek x1 


Ugg = X U UU X (n21) (1.3.1) 


Then we have the following result: 
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THEOREM 3.1 The set X C A* is a code if and only if none of the sets U, defined 
above contains the empty word. 


3.1 
EXAMPLE 2 j (continued) The word ba is in Uj, next a € U2, then bb € U3 and ba 4 
finally abb € Us and since 1 S U6» the set X is not a code, according to Theorem 
The proof of Theorem 1s based on the following lemma. 


3.1 


Ae) 
LEMMA 3.2 Let X C A* and let (Un)n>1 be defined by (Sa with Uı an arbitrary 
subset of A*. For alln,k with 1 < k < n, one has 1 € Un if and only if there exist a 
word u E€ Ug and integers i,j > 0 such that 


uX’ nN XÍ #0, i+j+k=n. (1.3.2) 


Proof. We prove the statement for all n by EEPE induction on k. ot i j 
k= n. If 1 € Un, taking u = 1, i = j = 0, Eq. ( 1s satisfied. Conversely, if ( 
holds, then i = j = 0. This implies u = 1 and consequently 1 € Uy. 

Now let n > k > 1, and suppose that the equivalence holds for n,n —1,...,4 +1. If 
1 € Un, then by induction hypothesis, there exists v € Uz41 and two integers i,j > 0 
such that 


uX' N XÍ #0, i+j+k+1=n. 


Thus there are words z € Xt, y € XÍ such that vx = y. Now v € Ug41, and there are 
two cases. Either there is a word z € X such that 


zv = u € Uķk 


or there exist z E€ X, u € Ux such that 


Figure 1.4: The two cases 1_04 
In the first case [see Fig. Iia), one has ux = zy, thus 
uX’ N XIT £, u€ Uz. 
In the second case [see Fig. IE), one has zz = uvg = uy, thus 
uXI n XH p, ue Xp. 
In both cases, Formula (Sais satisfied. 
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Conversely, assume that there are u € Ux and i,j > 0 with 
uX’ N XÍ £0, i+j+k=n. 
Then 
UTIT2''' Li = Y1Y2 `: Yj 


for some £r, Ys in X. One has 7 > 1 since if j = 0, then 7 = 0 and k = n, contrary to 
the assumption k < n. Once more, we distinguish two cases, according to the length 
of u compared to the length of yı. 

If u = yv for some v € A*, then v € XIU, C Uk+1 and further 


Thus vX’ N XJ-! Æ @ and by the induction hypothesis 1 € Un. 
If yı = uv for some v € At, then v € Ux C Uk+ı and 


TIT2':' Ti = VY2''' Yj, 


showing that X’ N vXJ-! Æ Ø. Thus again 1 € Up by the induction hypothesis. This 
concludes the proof. n 


The lemma will be used with U1 = X~1X \1 in the next proof, and with other values 
in later chapters. al l 
Proof of Theorem Bayarx is not a code, then there is a relation 


T12: Ep = Y1Y2 Yq, MH YZ EX, MFM. 
Assume |y1| < |x1|. Then zı = y,u for some u € A*. But then 
u€U, and uX? tn xe! SO. 


According to the lemma, 1 € Up+q-1- 

Conversely, if 1 € Un, take k = 1 in the lemma. There exists u € U; and integers 
i,j > 0, such that uX’ N XJ Æ Ø. Now u € U, implies that ru = y for some x,y € X. 
Furthermore x Æ y since u Æ 1. It follows from ruX*N sX’ Æ Ø that yX*n«X/ £90, 
showing that X is not a code. This establishes the theorem. n 


7 3.3 3.1 | : f 
Proposition PF stiows that Theorem provides an algorithm for testing whether a 
recognizable set is a code. 


PROPOSITION 3.3 If X C A* is a recognizable set, then the set of all Un (n > 0) 
is. finite. 


This statement is straightforward for a finite set X since each Up is composed of 
suffixes of words in X. 
Proof. Let o be the syntactic congruence of X, defined by 


w = w’ moda iff (Vu,v € A* uwv € X & uwv Ee X). 
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Let u be the congruence of A* with the two classes {1} and A*. Let = o Nu. We use 
the following general fact. 

If L C A* is a union of equivalence classes of a congruence 0, then for any subset 
Y of A*, Y~!L is a union of congruence classes mod 0. (Indeed, let z € YTIL and 
z' = z mod 0. Then yz € L for some y € Y, whence yz’ € L. Thus z’ € YIL). 

We prove that each Up is a union of equivalence classes of 1 by induction on n > 1. 
For n = 1, X is a union of classes of ø, thus X~1X also is a union of classes for ø, and 
finally X~!X — 1 is a union of classes of 1. Next, if U» is a union of classes of 1, then 
by the previous fact both U;1X and X~!U,, are unions of classes of 1. Thus Un+1 is 
a union of classes of v. The fact that X is recognizable implies that ų has finite index. 
The result follows. a 


3.4 
EXAMPLE 2 i (continued) For X = {b, abb, abbba, bbba, baabb}, we obtain 


U, = {ba, bba, aabb}, X-!U, = {a, ba}, UTX = {abb} 
Uy = {a, ba, abb}, XU, = {a,1}, U; 'X = {bb, bba, abb, 1, ba} 


Thus 1 € U3 and X is not a code. 
EXAMPLE 3.2 Let X = {a,ab,ba} and A = {a,b}. We have 

Uı = {b},  U2={a}, U3={1,b}, Us=X,  Us5=U 
The set U3 contains the empty word. Thus X is not a code. 


EXAMPLE 3.3 Let X = {aa, ba, bb, baa, bba} and A = {a,b}. We obtain U; = {a}, 
U> = U,. Thus Un = {a} for all n > 1 and X is a code. 


If X C At is prefix (thus a code), then U, = X-!X — 1 = Ø. Thus the algorithm 
ends immediately for such codes. On the other hand, if X is suffix, the algorithm does 
not stop after one step. This asymmetrical behavior is due of course to the definition 
of the U, which favors computing from left to right. 


4 Unambiguous automata 


An automaton A = (Q,/,T) over A is unambiguous if for all p,q E€ Q and w E€ A*, 
there is at most one path from p to q with label w in A. 


PROPOSITION 4.1 Let A = (Q,i,t) be a trim automaton with a unique initial and a 
unique final state. Then A is unambiguous if and only if |A| is a characteristic series. 


Proof. If A is unambiguous, then clearly |A| is a characteristic series. Conversely, if 
there are two distinct paths from p to q labeled with w for some p,q € Q and w € A’, 
then choosing paths i — p and q —> t, we have 


(A|, uwv) > 2. : 


If A is an unambiguous automaton, we usually identify the series |A| with the set L(A) 
recognized by A. 
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PROPOSITION 4.2 Let X C A* and let A be an automaton such that |A| = X. Then 
X is a code if and only if A* is unambiguous. 


1.4 
Proof. According to Proposition E: we have |A*| = (X )*. Since A* is trim, Propo- 
sition ‘shows that A* is unambiguous if and only if |A*| is a characteristic series. 
Since L(A*) = X*, this means that A* is peat" if and only if X * = (X_)*. Thus 


we get the proposition from Proposition fi. n 


In view of Proposition awe can determine whether a set X given by an unambigu- 
ous automaton A is a code, by computing A* and testing whether A* is unambiguous. 
For doing this, we may use the following method. 

Let A = (Q,/,T) be an automaton over A. Define the square of A 


S(A)=(OxQ,1 x 1,7 xT) 


by defining 
(p1,p2) — (41, 42) 
to be an edge of S(A) if and only if 


a a 
Pı —> q and = pz —> q@ 
are edges of A. 


PROPOSITION 4.3 An automaton A = (Q,I,T) is unambiguous if and only if there is 
no path in S(A) of the form 


(pp) > (r, 8) > (qa) (1.4.1) 

with r Æ s. 
8 
Proof.The existence of a path of the form in S(A) is equivalent to the existence 
of the pair of paths 
u vU U VU 
p—r—q and p — s —q 

with the same label uv in A. m 


To decide whether a recognizable set X (given by an unambiguous finite automaton) 
is a code, it suffices to compute A* and to test whether A* is un higuous by inspecting 
the finite automaton S(A*), looking for paths of the form finite 


EXAMPLE 4.1 Consider, again, the automaton A* of Example [0} 
convenience in Figure one automaton S(A*) is given in Fig. [-6f where only the 
part accessible from the states (q,q) is drawn. It shows that A* is unambiguous. 


1.5 
The following proposition is the converse of Proposition E 
PROPOSITION 4.4 Let A = (Q,1,1) be an unambiguous automaton over A with a single 
initial and final state. Then its behavior |A| is the characteristic series of some free 


submonoid of A*. 
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4 
Figure 1.6: Part of the square of the automaton of Fig. E 4_05 


Proof. Let M C A* be such that |A| = M. Clearly the set M is a submonoid of A*. 
We shall prove that M is a stable submonoid. For this, suppose that 


u, wv, uw,v E M. 


Then there exist in A paths 


1 — 1, 1 — 1, 1— 1, 1 — 1 
The two middle paths factorize as 
1 => p= 1, 1 = ge 


for some p,q € Q. Thus there exist two paths 


1 aa | “op Lf 
1 “sq 2,151. 


Since A is unambiguous, these paths coinci & whence 1 = p = q. Consequently w € M. 
Thus M is stable, and by Proposition M is free. n 


We now prove a result about the determinant of a matrix which is associated in a 
natural way with an automaton. For this, we introduce some terminology. 
For w € A*, we denote by a(w) the commutative image of w which is the monomial 


a(w) = II qlvla 


acA 
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in the commuting variable a € A. More generally, a(o) is the commutative image of 
the formal series ø. It is defined for any monomial t in commuting variables by 


(a(c),t)= $ (ow). 


a(w)=t 


For a subset X C A* we write a(X_) = X. We denote by K[A] and K[[A]] the sets of 
(commutative) polynomials and series with variables in A and with coefficients in the 
semiring K. bisi 

Formula (EEJ Bives an expression of the polynomial 1 — X for a finite code X. 
PROPOSITION 4.5 Let X C At be a finite code and let A = (Q,1,1) be a finite unam- 
biguous trim automaton recognizing X*. Let M be the Q x Q-matrix with elements in 
QA] such that Mpq is the sum of the elements of the set 


Ap = {a E A| p= q}. 


Then 
1— X =det(I — M). (1.4.2) 


Proof. Any path q —> q with q 4 1 and w € At passes through state 1. Otherwise 
uw*v C X for words u,v such that 1 — q —> 1, contradicting the finiteness of X. 
Thus we can set Q = {1,2,...,n} in such a way that whenever i > j fora € A, j #1, 
then i < j . Define for i,j € Q, an element of Q(A) by 


Tij = Oi = Aij (1.4.3) 
where 6;; is the Kronecker symbol. Let A be the polynomial 


A= `> (11) rr e226 -Tn no, 
oEGn 


where e(o) = +1 denotes the signature of the permutation ø. According to the well- 
known formula for determinants we have 


det(I — M) = a(A). 
Thus it suffices to show that 
A=1-X. (1.4.4) 
For this, let 


Ao = T1 1oT2,20 » - - Tn,no, 


A= Y (RA 


oe Gn 
Consider a permutation o € G,, such that A, #0, I Ep then it has at least one 
cycle (iiiz... ip) of length k > 2. Since A, # 0, by ( e sets Avis) Áisizs =- , Bay 
are nonempty. This implies that the cycle (i1 ...,ig) contains state 1. Consequently 


so that 
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each permutation o with A, Æ 0 is composed of fixed points and of one cycle containing 
1. If this cycle is (i1,22,...,%%) with i1 = 1, then 

L<ig<...< tp 


by the choice of the ordering of states in A. Set Xo = Aj, Aini,...Aj,1. Then 
As = (=1)* X, and also (=1)) = C1)". 
The set Xo is composed of words aja2...a, with a; E€ A and such that 


1 > 19 > 13 bores > ik >1. 


These words are in X. Denote by S the set of permutations ø € G — 1 having just one 
nontrivial cycle, namely, the cycle containing 1. Then X = J ses X o since each word 


in X is the label of a unique path (1,%2,--- ,2%,1) with 1 < ig < --- < ig. It follows 
that 
A=14+)-(-1I)A, =1-) > X,=1-X. 1" 
ocs oES 


ig4.1.4 
EXAMPLE 4.2 Let X = {aa, ba, bb, baa, bba}. The automaton given in Figure is TeC- 
ognizes X*. The matrix M is here 


0 a b 
M = a 0 0 
a+b a+b 0 


and one easily checks that indeed 1 — det(M) = 1 — a(X ). 


5 Measure of a code 


In this section, we give a precise formulation to the idea that a code has only few words, 
by introducing the notion of the measure of a code. 
A Bernoulli distribution on A* is a function 


t: A* — R4, 


which is a morphism into the multiplicative monoid R, of nonnegative real numbers, 
and which moreover satisfies 
5 aa) = Le 


acA 
A distribution is positive if r(a) > 0 for all a € A. It follows from the definition that 


and that 


Indeed, for u € A”, we have 


bp n(ua) = 1(u) `> nla) =r(u), 


acA acA 
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thus 


`> n(u) = `> n(v) X x(a) = `> n(v). 


uEAntl vEAn acA vEeAn 


Thus 7 defines a probability distribution on each set A”. We extend m to $B(A*) by 
setting for L C A* 


Then m becomes a function 
mt: P(A*) — RU {+00} 
and satisfies the following properties: 
m(L)>0 foral Lc A*. m(0) =0, 
For any family (L;)ier, of subsets of A* 
z(U Li) < So r(Li), (1.5.1) 
ie ier 
and if the sets L; are pairwise disjoint, then 
m(U Li)) =Y a(Li). (1.5.2) 
ie icI 


The value 7(L) is the measure of the set L relative to 7. It is a nonnegative number 
or +00. 

There is a special case of these formulas which is useful for the computation of the 
measure of a set. 

Let L C A*, and for n > 0 set 


Sn = T{w EL ||w| <n}. 


Then the following formula holds in R: 


m(L) = sup Sp. (1.5.3) 


n>0 
EXAMPLE 5.1 Let A be a finite nonempty set, and 
j= eA 
rla) = —— . 
a Card(A)’ s 


Then 7 defines a Bernoulli distribution on A* called the uniform distribution. 


EXAMPLE 5.2 Let A = {a,b} and X = {a, ba, bb}. Let m : A* — R+ be a distribution. 
Setting as usual 


m(a) = p, m(b)=q=1-p, 
we have 
mX)=ptpqt+@=ptpqt+(l—-p)g=pt+q=l. 
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If L, M are subsets of A*, then 


LM =|) UL {en}. (1.5.4) 


LEL mEM 


It follows from ihat 
mLM) < X` X w(é)r(m) = So (4H) X` n(m) =xr(Lyr(M). (1.5.5) 


LEL mEM LEL mEM 


.5 4 
The inequality in (fg nay be sat jf the uni nin Eiis not disjoint. 


For any subset X of A*, Eqs. and ( “imply that 


MASY ay ar (1.5.6) 


n>0 


PROPOSITION 5.1 Let X C At and let m be a Bernoulli distribution on A*. 
1. If X is a code, then 


n(X”)=r(X)” for n>1 (1.5.7) 


and in particular 1(X*) < œ if and only if m(X) <1. 
2. Conversely, if m is positive, if n(X) is finite, and if 


m(X”")=n(X)” forall n>1, (1.5.8) 
then X is a code. 


Proof. For n > 1, let Sn be the n-fold Cartesian product of X: Sn = X x X x.x X. 
(1) Assume that X is a code. Then the function 


(£1,.-., En) > T1: Ly 


is a bijection of S, onto X”. Consequently 


a= J maene) 9 a(x) = naX": 


(1,.-,%n)ESn rEXnr 


T 
This shows Ei Next the sets (X"),>0 are pairwise disjoint. Thus 


m(X*) = So aA") 


n>0 


A 
This together with proves the second equality. The last one is an immediate 
consequence. 
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(2) Assume that X is not a code. Then there is a word u € X* having two distinct 
factorizations in words in X, say 


/ od / 
U = TIL: Lp = L{X_°++ Ly, 


with n,m > 1 and zi, xi € X. The word w = uu has two factorizations: 


/ 


W = LIL Aye ty HP LEIL En 


of the same length k = n + m. It follows that 


mX)F = XO  n(y yk) = a(X")+ rlw). 
(Y15-Yk)ESk 
8 
But the finiteness of 7(X) and condition (5-3 imply m™(w) <0, hence z(w) = 0. This 
contradicts the hypothesis that m is positive. n 


THEOREM 5.2 If X is a code over A, then n(X) < 1 for all Bernoulli distributions m 
on A*. 


In the case where the alphabet A is finite and where the distribution 7 is uniform, 
we obtain 


COROLLARY 5.3 Let X be a code over an alphabet with k letters; then 
` kalel <1. 7 
rex 


Proof of Theorem “Ate prove the statement first in the case where the numbers |z], 
for x € X, are bounded. (This is the case of a finite code, but a code satisfying the 
condition is not necessarily finite when the alphabet is infinite.) Thus we assume that 
there is some integer k > 1 with 


XC AU A*U:-UAF, 
It follows that for n > 1, X” C AU A?U--- A”, hence 


TX) < nk. (1.5.9) 


Arguing by bebe Ea gssume now that m(X) > 1, that is, r(X : = 4 € for some 


c > 0. Then by Eq. ( , tr(X”) = (1+ 6)” for n > 1. In view of ( , we have for 
aln>1 
(1+e€)” < kn 


which is impossible. Thus 7(X) < 1. 
If X is an arbitrary code, set for n > 1 


Xn = {xE KX | le) <n}. 
The set X, is a code satisfying the conditions of the first part of the proof. Consequently 


n(Xn) <1. 
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3 
This shows, in view of Eq. (3h that 


a LX) = supr(Xn) <1. m 
n>1 


EXAMPLE 5.3 Let A = {a,b}, and X = {b, ab, ba}. Define m by (a) = 1/3, 7(b) = 2/3. 


anes 2 2 2 10 
mX)=s+-4+-= 


3°9'9 9 
thus X is not a code. Note that for 7(a) = 7(b) = 1/2, we get 7(X) = 1. Thus it is 
impossible to conclude that X is not a code from the second distribution. ° 
-4.2 
The following example shows that the converse of Theorem is false. 


EXAMPLE 5.4 Let A = {a,b}, and X = {ab, aba, aab}. The set X is not a code since 
(aba)(ab) = (ab) (aab). 


However, any Bernoulli distribution 7 gives 7(X) < 1. Indeed, let m(a) = p, 7(b) = q. 
Then 
= 2 
™(X) = pq + 2p°q. 


It is easily seen that we always have pq < ; and also p?q < a, provided that p+q= 1. 


Consequently 


1 8 
De e eT 
masg ta j 


-4.2 
This example gives a good illustration of the limits of Theore jp its use for 
testing whether a set is a code. Indeed, es geh X of Example | where the test 


fails, is obtained from the set of Example 6.3, where the test is successful, simply by 
replacing b by ab. This shows that the counting argument represented by a Bernoulli 
distribution takes i tp account the lengths as well as the number of words. In other 
words, Theorem Bose ws us to conclude that X is not a code only if there are “too 
many too short words.” 

Proposition Batis very useful for proving that a code is maximal. The direct method 
for proving maximality, based on the definition, indeed is usually much more com- 
plicated than the verification of the conditions of the proposition. A more precise 
statement, „olding for a large class of codes, will be given in the next section (Theo- 
rem 


PROPOSITION 5.4 Let X be a code over A. If there exists a positive Bernoulli distri- 
bution n on A* such that nr(X) = 1, then the code X is maximal. 


Proof. Assume that X is not maxi al. Then there is some word y ¢ X such that 
Y = X Uy is a code. By Theorem þp.% we have 7(Y) < 1. On the other hand, 


mY) =7(X)+r(y)=1+7(y). 
Thus z(y) = 0, which is impossible since 7 is positive. m 


4.2 
EXAMPLE p3 (continued) Since 7(X) = 1 and X is prefix, X is a maximal code. 
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2.5 
EXAMPLE 5.5 We examine again the Dyck code D over A = {a,b} of Example Est 
Let m : A* — R, be a positive Bernoulli distribution, and let 


m(a) =p, 7(b) =q 
dn = T(DN A"), d® =2(D* NA"), (n>). 


In view of computing the measure of D, consider the formal power series 


We have for n > 0 
* 2n\ non * 
= (ora, da =o. 


On the other hand, m(L N A") < 1 for any L C A* and for all n. In particular, 


di) < 1 for all n > 0, and consequently the series „>o dS ) yn converges in the region 


{z€C| |z| < 1}. In fact 


1 
d) (z) = ——— for |z| < 1 (1.5.10) 


\/1 — 4pqz? 


since in this region, the (generalized) binomial formula gives 


1 i 
(=1)8 ( a Anh he" 
>0 


4/1 — 4pqz? ~ > 


and an elementary verification shows that 


(7) Za. r (1.5.11) 


Each word in D* N A” has a unique factorization into words in D. Thus 


d= X dmim dmy: 


k>0 mit +m =n 
For the corresponding generation functions, this gives 


1 


d® (t) = Tat" (1.5.12) 


Consequently we have, for |z| < 1, 


1 
= Ee as — 2 
g= =i oe) 1— /1—4pqz?. (1.5.13) feqi.4.13 
It follows that 


m™(D) =) d= lim $ dz” = lim 1- v1- 4pqz? = 1— y1- 4p. 


n>0 z€[0,1[ n>0 z€(0,1[ 
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It is easily seen that 1 — v1 — 4pq = 1 — |p — q|. Thus we get 


t(D) =1—|p—a| = 1- |r(a) — x(6)]. 


For z(a) a Ba ve 7(D) = 1. This gives another proof that D is a 
maximal R xa nlp BSN ote that 7(D) < 1 for any other Bernoulli distribution. 
By Formula , we haved (using again the binomial formula) 


1/2 
in = (ca / jere. 
n 
14 
Since (1/7) = (1/2n) -3 ) we obtain (using again Formula ( 
n n-1 


2 (2n—2 
dan = 2 (7 era. (1.5.14) 
n\n-1 


This gives as a byproduct the following formula 


2 (2n —2 
Cara(D 9 a2) = (m i: 


n— 1 


Let D' = D NaA* = DN A*b. The set D™* is exactly the set of well parenthesized 
strings when letter a stands for ‘(’ and letter b for ‘)’. The previous formulas allow us 
to compute easily its length distribution. Let D” = D N bA* = D N A*a and 


d} = n(D' N A”), d! = n(D" N A”), 


let d'(t), d” (t) be the series d'(t) = X n>0 dnt”, da (t) = D n>0 dnt”. We have D' = D" 
and d'(t) = d"(t). Since d(t) = d'(t) + a(t), we obtain, a(t) = La(t). Let di, = 
m(D* N A”) and dt) = ys dye”. Then, as for ( 


1 
1—d'(t)’ 


13 
Since d'(t) = 4d(t), we obtain from Ba 


d’“)(t) = 


d(z) 


2 
1+.f1—4pqz2 2pqz? ` 


1 2n 
d! (*) = non 
2N n+1 n pq 


14 
Therefore by Ea 


and also 


1 2 
Card(D™ N A?”) = = ( ") (1.5.15) 
n+1 
The numbers Yn = an? ”) for n > 0 are called the Catalan numbers. Their first values 
are given below. ° 
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milo 1 23 4 5 
m 1125 12.2 


Table 1.1: The Catalan numbers 


EXAMPLE 5.6 The set X = Un>oa"bA” is prefix, and therefore is a code over A = 
{a,b}. It is a maximal code. Let indeed 7m be a positive Bernoulli distribution, and set 
p= Tr(a). Then 

m(a"bA”) = p” (1 — p) 


hence 
m(X)= So P- p) = (1/1 - p))(1- p) = 1. 
n>0 
4.3 
We now give a statement which proves that the inequality of Corollary is actually 
tight. 


THEOREM 5.5 Given a sequence (tUn)n>1 of integers, there exists a code X over an 
alphabet A of k symbols such that un = Card(X N A") if and only if 


Stak <1. (1.5.16) 


n>1 
Moreover, the code X can be chosen to be prefix. 


Inequality (tay is called the Kraft inequality. 
Proof. It suffices to prove the existence of X. Observe first that by the inequal- 
ity, one has also J j<i<n uik? < 1 or equivalently, multiplying both sides by k”, 
ee ujk™—* < k” for all n > 1. Let us prove by induction on n > 1 that there exists 
a prefix code X, on an alphabet A of k symbols such that Card(X„ N AŻ) = u; for 
Ll<i<n. 

This is true for n = 1 since uy < k. Next, suppose that the property holds for 
n. The set of words of length n + 1 with a prefix in Xn is Uj<icn(Xn N A*)AM*E 
Consequently, the number of words of length n + 1 with a prefix in Xn is 


s= ) ukt tt, 


1<i<n 


Since s+ Un+1 < k”tl, we can choose a set Y of Un+1 words of length n + 1 without 
a prefix in Xn. In this way, the set Xn41 = Xn UY is a prefix code with length 
distribution (ui)i<i<n+1: C] 


6 Complete sets 


Any subset of a code is itself a code. Consequently, it is important to know the structure 
of maximal codes. Many of the results contained in this book are about maximal codes. 

The notion of complete sets introduced in this section is in some sense dual to that 
of a code. For instance, any set containing a complete set is itself complete. Even if 
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the duality is not perfectly balanced, it allows us to formulate maximality in terms of 
completeness, thus replacing an extremal property by a combinatorial one. 

Let M be a monoid and let P be a subset of M. An element m € M is completable 
in P if there exist u,v in M such that umv € P. It is equivalent to say that P meets 
the two-sided ideal MmM, 

MmMnP#6 


or, in other words, that 
mé€F(P)=M ‘PM. 

A word which is not completable in P is incompletable. The set of words completable 
in P is of course F(P); the set F(P) = M \ F(P) of incompletable words is a two-sided 
ideal of M which is disjoint from P. 

A subset P of M is dense in M if all elements of M are completable in P, thus if 
F(P) = M or, in an equivalent way, if P meets all (two-sided) ideals in M. Clearly, 
each superset of a dense set is dense. 


The use of the adjective dense is justified by the fact that dense subsets, of M are 
exactly the dense sets relative to some topology on M (see Exercise Bah 


EXAMPLE 6.1 Let A = {a}. The dense subsets of A* are the infinite subsets. 


EXAMPLE 6.2 In a group G, any nonempty subset is dense, since GmG = G for m in 
G. 


EXAMPLE 6.3 The Dyck code D over A = {a,b} is dense in A*. Indeed, if w € A*, 
then v = a2llewol! is easily seen to be in D*. Furthermore, no proper nonempty prefix 
of v is in D*. Thus v is in D, showing that w is completable in D. 


It is useful to have a special term for codes X such that the submonoid X~* is dense. 
A subset P of M is called complete in M if the submonoid generated by P is dense. 


Every dense set is also complete. Next a subset X of A* is complete if and only if 
F(X*) = A*. 


EXAMPLE 6.4 Any nonempty subset of a™ is complete, since it generates an infinite 
submonoid. 


The following theorem is of fundamental importance. 
THEOREM 6.1 Any maximal code is complete. 
The theorem is a direct consequence of the following proposition. 
PROPOSITION 6.2 Let X C At be a mazimal code. For any word w € A*, one has 
X*wA*nx* #0. 


Proof. The bas t i dg, clear if Card(A) = 1 of if w is the empty word. Otherwise, by 


Proposition 08.6, there is a word w’ € AT such that y = ww’ is unbordered. Set 
Y = X Uy. It suffices to prove that X*yA* N X* Æ Ø. Since Y is not a code, we 
have 21°+-2n = Yit Ym with n,m > 1, x,y; € Y and zı Æ yı. Since X is a 
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code, at least one of the x;,y; is equal to y. Consider the leftmost occurrence of y 
among the xi, yj. We may assume that it occurs among the x;, say at index k. Thus 


T1,---,Up-1 E X, £k = y. Let l be the least index such that x1 --- 2x, is refix of 
Yi Ye. Set w = z1- tku = yr-+- ye. Clearly w E€ X*yA* (see Figure (ate 


Ly Tk—1 Tk =y 


Yı Yp Ye 


Figure 1.7: 


prove that w € X* by showing that y1,..., Yọ E X. Let p be the least index such that 
£1 +++ Xp] İS a prefix of y1 --- Yp- Set £1- Lk—-1V = Y1: Yp, With v not empty because 
X is a code. Thus £u = vyp41--- Ye. One has y1,..., Yp E X by the minimality of k. 
Next, Yp+1,---,Ye-1 are proper factors of x, = y and therefore are also in X. Finally, 
ye # y since y is unbordered. So ye € X and w € X*. 7 


„5.6 
Example Btbclow shows that a finite code is not always contained in a finite maximal 
code. However, one has the following property. 


THEOREM 6.3 Every rational code is contained in a maximal rational code. 


The proof relies on the following result. 


PROPOSITION 6.4 Let X C At be a code. Let y € A* be an unbordered word such that 
A*yA* 0N X* = 9. Let 
U = A* — X* — A*yA*. (1.6.1) 
Y =X UyUy)* (1.6.2) 


Then the set 


is a complete code. 


Proof. Set V = A* — A*yA*. Then by assumption X* C V and U = V — X*. Let us 
first observe that the set Z = Vy is a prefix code. 
Assume indeed that vy < v'y for two words v and v’ in V. Since y is unbordered, vy 
must be a prefix of v’. But then v’ is in A*yA*, a contradiction. Thus Z is prefix. 
Now we show that Y is a code. Assume the contrary and consider a relation 


Y1Y2 `: Yn = YY Ym 


with y1,..., Yh, € Y and yı Æ y|. The set X being a code, one of these words must be 
in Y — X. Assume that one of y1,...,Yn is in Y — X, and let p be the smallest index 
such that yp € y(Uy)*. From y ¢ F(X*) it also follows that yp ¢ F(X*). Consequently 
one of yj,---;Ym is in y(Uy)*. Let q be the smallest index that y, € y(Uy)*. Then 


Yio Yp-1Y, YY YY EZ 
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whence y1 +++ Yp—1 = Y4: Vout since Z is prefix. The set X is a code, thus from yı 4 y4 
it follows that p = q = 1. Set 


Yı = YUY YUkY, Y1 = YUY: YUY, 


with u1,..., Uk, Uh,- -U E U. Assume y1 < yj. Since Z is prefix, the set Z* is right 
unitary. From U C V, it follows that each u;y, uży is in Z. Consequently 


U1 = Uj,- Uk = Uh 
Let t = u14: yuy. We have 
/ / 
Y2°°* Yn = tY2 ` Ym: 


The word y is a factor of t, and thus occurs also in y2--- yn This shows that one of 
Y2,-++;Yn, SAY Yr, is in y(Uy)*. Suppose r is chosen minimal. Then y2---y,-1y E€ Z 
and uj, 414 © Z are prefixes of the same word. With the set Z being prefix, we have 


/ 
Unit = Y2''' Yr-1- 


Thus uj,,, € X*, in contradiction with the hypothesis uj,,, € U. This shows that Y is 
a code. 
Finally, let us show that Y is complete. Let w € A* and set 


W = V1YV2Y ` : ' YUn—-1YUn 
with n > 1 and v; E€ A* — A*yA*. Then 
ywy € Y*. 
Indeed let v;,, Viz,- -Vip be those v;’s which are in X*. Then 


yoy = (yury s+ YViz—1Y Wi, (Ys, 41 Y °° YViz—1Y) 


Xo X Vi, (YY, +1Y +++ YUny) 
Each of the parenthesized words is in Y. Thus the whole word is in Y*. m 


.5.1t A 
Proof of Theorem bt Since X is rational, the set U defined in Equation (Eos also 


rational. E 


EXAMPLE 6.5 Let A = {a,b} and X = {a,ab}. The word y = bba is unbordered and 
is incompletable i X74, deterministic automaton recognizing U = A* \ X* \ A*yA* 
is given in Figure fe 

Accordingly, we have after some rewriting the rational expression 


U = bt + X*abb* + bX*ab*. 


Consider a Bernoulli distribution given by (a) = p, 7(b) = q. Then a gasy computa- 
tion shows that 7(U) = 1/pq and thus 7(Y) = 1 for Y defined by ( , which implies 
that Y is maximal. ° 
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Figure 1.8: An automaton recognizing U 


EXAMPLE 6.6 Let A = {a,b} and X = {bb, bbab, babb}. The word y = aba is incom- 
pletable in X*. However, X Uy is not a code, since 


(bb) (aba)(babb) = (bbab)(aba) (bb) . 


iz 5.2 i ! ; 
This example shows that Proposition Ba is false without the assumption that y is 
unbordered. 


ctioni.1 
EXAMPLE 6.7 We are able now to verify one of the claims made in Section came 
that there do exist finite codes which are not contained in a maximal finite code. 
Let X = {a°, ba”, ab, b}. It is a code over A = {a,b}. Any maximal code containing 
X is infinite. Indeed, let Y be a maximal code over A containing X, and assume Y 
finite. Set m = max{|y| | y € Y} and let 


u= pr gitdmpm ; 
The maximality of Y implies its completeness. Thus u is a factor of a word in Y*. 


Neither b” nor a4t5™ can be proper factors of a word in Y. Thus there exist y, y’ € YU1 
and integers p,q,r > 0 such that 


u = bPyaty’b" 


5 
with a? € Y* (see Fig E9. The word až is the only word in Y which does not contain 
b; thus q is a multiple of 5; this implies that |yļa + |y’|a = 4 mod 5. 


br at+5m b™ 


y y’ 


Figure 1.9: The factorization of b™a‘t+°™b™ in words in Y 


Let y = basti and y’ = ato with 0 < i,j < 4. We have i+ j = 4 mod 5 whence 
i+ j= 4. We will show that any choice of i,j leads to the conclusion that Y is not a 


code. This yields the contradiction. 
If i = 0, j = 4, then k > 1 and we have ba? - a®#+4pk = b . að(Œ+1) . ab. b¥-1, 
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If i= 1, j = 3, then b’a®+! . b = b” . a5 - ab. 

If i = 2, j = 2, then b+ a2*5*p* = ba? - a” - bF, 

If i = 3, j = 1, then h > 1 and b*a5s+3 . b = b’! . ba? - a8 - ab. 

Finally, if i = 4, j = 0, then b’a5*+4 . ab = bP. g5le+) . p, ° 

This example is a particular case of a general construction (see Exercise bay? 

The converse of Theorem is false (see Example . However, it is true under an 
additional assumption that relies on the following definition. 

A subset P of a monoid M which is not dense is called thin. If P is thin, there is at 
least one element m in M which is incompletable in P, i.e., such that MmM N P= f, 
or equivalently F(P) # M. ee 

The use of the adjective thin is justified by results like Proposition JELOS A 


PROPOSITION 6.5 Let M be a monoid and P,Q, R C M. Then the set PUQ is thin if 
and only if P and Q are thin. If R is dense and P is thin, then R— P is dense. 


Proof. If P and Q are thin, then there exist m,n € M such that 
MmMnP=9, MnMnQ=9. 


Then mn is incompletable in PU Q and therefore PU Q is thin. Conversely if PUQ is 
thin, there exists m € M which is incompletable in P U Q and therefore incompletable 
in P and also in Q. Hence P and Q are thin. If R is dense in M and P is thin, then 
R-— P cannot be thin since otherwise R = (R\ P) UP would also be thin by the above 
statement. m 


Thin subsets of a free monoid have additional properties. In particular, any finite 
subset of A* is clearly thin. Furthermore, if X,Y are thin subsets of A* then the set 
XY is thin. In fact, if u g F(X), v g F(Y), then w ¢ F(XY). 


.5.3 
EXAMPLE 6.8 The Dyck code D over A = {a,b} is aw (See Example Bayt isa 


maximal code since it is a group code (see E aple . For each x € D, the code 
D — zx remains dense, in view of Proposition 6.9, and thus remains complete. But of 
course D — æ is no more a maximal code. This example shows that the converse of 
Theorem 6.1} doés not hold in general. 


5. 
Theorem admits a converse in the case of codes which are both thin and complete. 
Before going on to prove this, we give some useful properties of these sets. 


PROPOSITION 6.6 Let X C A* be a thin and complete set. Let w be a word incom- 
pletable in X. Then 


A= |) gg ao ee. (1.6.3) 
dED,gEG 


where D and G are the sets of suffixes (resp. prefixes) of w. 
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Note that the set D x G is finite. 
Proof. Let z € A*. Since X* is dense, the word wzw is completable in X*, thus for 
some u,v € A* 

uwzwy € X*. 


Now w is not a factor of a word in X. Thus there exist two factorizations 
w = gid = gd 


such that 
ugi,dzg,djv € X*. 


This shows that z € d~!X*g7!. n 


PROPOSITION 6.7 Let X be a thin and complete subset of A*. For any positive Ber- 
noulli distribution n on A*, we have 


m(X)>1. 


3 
Proof. We have 7(A*) = oo. In view of Eq. (53H there exists a pair (d,g) € Dx G 
such that (d~1X*g-!) = œ. Now 


d(d~'X*g71)g C X*. 


This implies 
m(d)n(d-* X*g~*)n(g) < W(X*). 


The positivity of m shows that 7(dg) 4 0. Thus 7(X*) = co. Now 


m(X*) < So m(X") < Y(X). 
n>0 n>0 


Assuming 7(X) < 1, we get 1(X*) < oo. Thus 7(X) > 1. m 


Note the following property showing, as already claimed before, that a thin set has 
only few words. 


PROPOSITION 6.8 Let X C A* be a thin set. For any positive Bernoulli distribution 
on A*, we have 
™(X) <œ. 


Proof. Let w be a word which is not a factor of a word in X: w ¢ F(X). Set n = |w]. 
We have n > 1. For 0 <i < n — 1, consider 


X,={x € X | |z| =imodn}. 
It suffices to show that 7(X;) is finite for i = 0, ... n — 1. Now 
Xic A‘(A” = w)* ; 
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Since A” — w is a code, we have 


m[(A" — w)*] = X aA" — w))* = S01 - aw). 


k>0 k>0 


The positivity of m implies 7(w) > 0 and consequently 


Thus 7(X;) < 1/7(w). n 
We are now ready to prove 


THEOREM 6.9 Any thin and complete code is maximal. 


Proof. Let X þe a in, complete code and let m ea positive Bernoulli distribution. By 


Propositio | W(X) > 1, gnd by Theorem | we have 7(X) < 1. Thus 7(X) =1. 
But then Proposition bat stiows that X is maximal. n 


.5.1 5.7 
Theorems Bara = can be grouped together to give 


THEOREM 6.10 Let X be a code over A. Then X is complete if and only if X is dense 
or maximal. 


Proof. Assume X is complete. If X is not dense, then it is thin, and consequently X is 
maximal by the previous theore -,Gonversely, a dense set is complete, and a maximal 
code is complete by Theorem b.t: _ n 


Before giving other consequences of these statements, let us present a first application 
of the combinatorial characterization of maximality. 


PROPOSITION 6.11 Let X C A* be a finite maximal code. For any nonempty subset B 
of A, the code X N B* is a maximal code over B. In particular, for each letter a € A, 
there is an integer n such that a” € X. 


Proof. 

The second claim results from the first one by taking B = {a}. Let n = max{|z| | 
x € X} be the maximal length of words in X, and let 0 # B C A. To sho that 
Y = X N B* is a maximal code over B, it suffices to show, in view of Theorem at 
Y is complete (in B*). Let w € B* and b € B. Consider the word 


w! = brtlbr t! . 
The completeness of X gives words u,v € A* such that 
uwv = T12: Tk 


for some z1, £2,..., £k E X. But by the definition of n, there exist two integers 7,7 
(1<i< j< k) such that 
UjLj+1°°* Tj = b" wb? 
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for some r,s € {1,...,n} (see Fig. ETO). But then zi, £i+1;.-- £j E X N B* =Y. This 
shows that w is completable in Y*. n 


Let X C A* be a finite maximal code, and let a € A be a letter. The (unique) integer 
n such that a” € X is called the order of a relative to X. 


a oN an a 


u b+! b+! 


Figure 1.10: The factorization of ub’t!wb"t!v 


THEOREM 6.12 Let X be a thin code. The following conditions are equivalent: 
(i) X is a maximal code. 
(ii) There exists a positive Bernoulli distribution n with n(X) =1. 
(iii) For any positive Bernoulli distribution 7, we have 7(X) = 1. 
(iv) X is complete. 


4.2 

Proof. ( i s Theorem bit T a (iii) is a consequence of Theore an 
a ili) = (ii) is not very ma and (ii) => (i) is Proposition Et n 
Theorem gives a surprisingly simple method to test whether a thin code is 


maximal. It suffices to take any positive Bernoulli distribution and to check whether 
the measure of the code equals 1. 


EXAMPLE 6.9 The Dyck code D over A = {a,b} is maximal an gomplete, but satisfies 
m(D) = 1 only for one Bernoulli distribution (see Example 6.3). Thus the conditions 
(i) + (ii) + (iv) do not imply (iii) for dense codes. 


EXAMPLE 6.10 The prefix code X = Un>o9a"bA” over A = {a,b} is dense since for all 
w € A*, al’lbw € X. It satisfies (iii), as we have seen in Example Pa thus X satisfies 
the four conditions of the theorem without being thin. 


THEOREM 6.13 Let X be a thin subset of At, and let m be a positive Bernoulli distri- 
bution. Any two among the three following conditions imply the third 
(i) X is a code, 
ü) r(X)=1, 
(iii) X is complete. 


Proof. (i) + N = aii). The conditio T(x } = 1 implies that X is a maximal code, 
by Proposition us by, T eorem Et complete. 

(i) + (iii) > (ii) Theorem and condition (i (i) imply that 7(X) < 1. Now X is thin 
and complete; in view of Proposition b: i we have 7(X) > 1. 

(ii) + (iii) > (i) Let n > 1 be an integer. First, we verify that X” is thin and 
complete. To see completeness, let u € A*, and let v,w € A* be such that vuw € X*. 
Then vuw € X* for some k > 0. Thus (vuw)” € (X")* c (X")*. This shows that u 
is completable in (X")*. Further, since X is thin and because the product of two thin 
sets is again thin, the set X” is thin. 
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5.5 
Thus, X” is thin and complete. Consequently, 7(X") > 1 by Proposition Beton the 
other hand, we have 1(X") < m(X)” and thus 7(X”) < 1. Consequently 7(X”) = 1. 
Thus for all n > 1 


w(x”) =X)". 
-4.1 
Proposition Bal stiows that X is a code. n 


Thin codes constitute a very importa class, of codes. They will be characterized 
by some finiteness condition in Chapter -We anticipate these results by proving a 
particular case which shows that the class of thin codes is quite a large one. 


PROPOSITION 6.14 Any recognizable code is thin. 


Proof. Let X C A* be a recognizable code, and let A = (Q,i,T) be a deterministic 
complete automaton recognizing X. Associate to a word w, the number 


p(w) = Card(Q- w) = Card{q-w|qe€ Q}. 


We have p(w) < Card(Q) and for all u,v, 


p(uwr) < p(w). 


Let J be the set of words w in A* with minimal p(w). The inequality above shows that 
J is a two-sided ideal of A*. 

Let w € J, and let P = Q - w. Then P -w = P. Indeed P -w C Q -w = P, and on 
the other hand, P -w = Q- w?. Thus Card(P - w) = p(w?). Since p(w) is minimal, 
p(w?) = p(w), whence the equality. This shows that the mapping p +> p: w from P 
onto P is a bijection. It follows that there is some integer n with the property with the 
mapping p |> p- w” is the identity mapping on P. 

Since P = Q- w, we have 


q-w=q: wt! forallgeq. 


To show that X is thin, it suffices to show that X does not meet the two-sided ideal 
J. Assume that JN X #0 and let z € X N J. Then 


icr=tET. 


Next x € J and, by the previous discussion, there is some integer n > 1 such that 
i-a"t! = t. This implies that x”t!t € X. But this is impossible, since X is a code. 


2 
The converse of Proposition Bris false. 


EXAMPLE 6.11 The code X = {a"b" | n > 1} is thin (e.g., ba is not a factor of X), 
but X is not recognizable. 
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.12 
EXAMPLE 6.12 In one interesting case, the converse of Proposition Bld holds: Any 
thin group code is recognizable. Indeed let X C A* be a group code. Let y: A* => G 
be a surjective morphism onto a group G, and let H be a subgroup of G such that 
X* = y!(H). By assumption, X is thin. Let m be a word that is incompletable in 
X. We show that H has finite index in G, and more precisely that 


G = Up<mH[6(p)|* , 


(where p runs over the prefixes of m). Indeed let g € G and w € y7!(g). Let u € A* 
be such that y(u) is the group inverse of gy(m). Then y(wmu) = gy(m)y(u) = 1, 
whence wmu € X*. Now m is incompletable in X. Thus m is not factor of a word in 
X and consequently there is a factorization m = pq such that wp,qu E€ X*. But then 
h = y(wp) € H. Since h = gy(p), we have g € Hy(p)~!. This proves the formula. 
The formula shows that there are finitely many right cosets of H in G. Thus the 
permutation group on the right cosets, say K, is also finite. Let a OF K be 
the canonical morphism defined by Hra(g) = Hrg (see Section i Then, setting 
N = {o € K | Ho = H}, we have H=a7!(N) = a~t (a(H)). Thus X* = oy! W(X"), 
where Y = a-y. Since K is finite fhjs_shows that X* is recognizable. Consequently, 


X is also recognizable (Exercise eq ° 
REMARK 6.15 . We have used in the preceding paragraphs arguments which rely 
basically on two tech igues: measures on the one hand which allowed us to prove 
especially Theorem Bge combinatorial arguments on words on the other (as 
in the proof of Theorem : 

It is interesting to note that some of the proofs can be done by using just one of 
the two techniques. A careful analysis shows that all preceding statements with the 
exception of those involving maximality can be established by using o ly arguments on 
measures. As an example, the implication (ii) > (iv) in Theorem can be proved 
as follows without usi the maximality of X. If X is not complete, then X* is thin. 
Thus, by Proposition 6.8, 7(X*) < co which implies 7(X) < 1 by Proposition b.I, 

Conversely, there exist, for some of the results given ere, combinatorial proofs which 
do not rely on measures. This is the case for Theorem 6.9, where the proof given relies 
heavily arguments ke measures. Another proof of this result will be given in 
Chapter Ki ilary 1 . This proof is based on the fact that if X C A* is a thin 
complete code, then all words w € A* satisfy 


(X*wX*)t A X* #0. 


5.7 
This implies Theorem Bd because according to this formula, X U w is not a code for 
w ¢ X and thus X is a maximal code. 


We end this section by showing how to test (by means of an automaton) whether a 
code is complete. 


PROPOSITION 6.16 Let X C A™, and let A = (Q,1,1) be a trim automaton recognizing 
X*. Then X is complete if and only if the transition monoid of A does not contain the 


null relation. 
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Proof. If X is complete, then there exist, for each w € A*, two words u,v € A* such 
that uwv € X*. Then there exists a path 1 — p + q > 1. This implies that (p,q) 
is in y(w) and consequently y4(w) is not null. 

Conversely, if y4(A*) does not contain the null relation, then for each w € A*, there 
exists at least one path p —> q. Since A is trim, there exist two paths 1 — p and 
q—> 1. Then uwv € X*. Thus X is complete. n 


7 Composition 


We now introduce a partial binary operation on codes called composition. This oper- 
ation associates to two codes Y and Z satisfying a certain compatibility condition a 
third code denoted by Y o Z. 

There is a twofold interest in this operation. First, it gives a useful method for 
constructing more complicated codes from simpler ones. Thus we will see that the 
composition of a prefix and a suffix code can result in a code that is neither prefix nor 
suffix. 

Second, and this constitutes the main interest for composition, the converse notion 
of decomposition allows us to study the structure of codes. If a code X decomposes 
into two codes Y and Z, then these codes are generally simpler. 

Let Z C A* and Y C B* be two codes with B = alph(Y). Then the codes Y and Z 
are composable if there is a bijection from B onto Z. If 8 is such a bijection, then Y 
and Z are called composable through (3. T ep £ defines a morphism B* — A* which is 
injective since Z is a code (Proposition . The set 


X =6(Y) c Z* c A“ (1.7.1) 
is obtained by composition of Y and Z (by means of 8). We denote it by 
X=Y oZ or X=Yo0Z, 


when the context permits it. Since (@ is injective, X and Y are related by bijection, and 

in particular Card X = Card Y. The words in X are obtained just by replacing, in th <a 
words o „each letter b by the word (b) € Z. The injectivity of 8, the Corollary pe 
and é ; a show the following result 


PROPOSITION 7.1 If Y and Z are two composable codes, then X = Y o Z is a code. 


EXAMPLE 7.1 Let A = {a,b}, B = {c,d,e} and 
Z = {a, ba, bb} C A*, Y = {cc, d, dc, e, ec} C B*. 


The code Z is prefix, and Y is suffix. Further Card B = Card Z. Thus Y and Z are 
composable, in particular by means of the morphism 8 : B* — A* defined by 


b(c)=a,  p(d)=ba, (e) = bb. 
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Then X =Y o Z = {aa, ba, baa, bb, bba}. The code X is neither prefix nor suffix. Now 
define 3’ : B* — A* by 
B(c)=ba, B(d)=a, Be) = bb. 


Then X’ = Y og Z = {baba, a, aba, bb, bbba}. This example shows that the composed 
code Y og Z depends essentially on the mapping £. 


Let X C A* be a code, with A = alph(X). Since A is a code over itself and X = (X), 
with .: A* — A* the identity mapping, we have 


X=X0,A. 
Let 3: B* — A* be a coding morphism for X. Then X = 6(B) and thus 
X=BosX. 


These examples show that every code is obtained in at least two ways as a composition 
of codes. 

The two expressions X = X o A and X = Bo X are exactly the particular cases 
obtained by replacing one of the two codes by the alphabet in the expression 


X=Y 0Z. 


Indeed, if Y = B, then Z = (B) = X; if now Z = A, then B can be identified with 
A, and Y can be identified with X. 
Notice also the formula 


X=Yo0o Z> X"=Y"o9Z n>2. 
Indeed, Y” is a code (Corollary rand 
Y”oZ=ß6Y")=X”. 
PROPOSITION 7.2 Let X C C*, Y C B*, and Z C A* be three codes, and assume that 
X and Y are composable through y and that Y and Z are composable through B. Then 
(X 0, Y) 0g Z = X ofoy (Y 0g Z). 


Proof. We may suppose that C = alph(X), B = alph(Y). By hypothesis the injective 
morphisms 
y:C* > B*, B : B* = A* 
satisfy 
(0) =¥,  B(B) =z. 
Let 6 : D* — C* be a coding morphism for X; thus 6(D) = X. Then 


D* ô, o* 7, BY É, A, 


and aD = a O = Xopy (¥ 09 Z), and also aD = = 
70(D) og P(B) = (X 0, Y)o Z. n 


Some of the properties of codes are preserved under composition. 
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PROPOSITION 7.3 Let Y and Z be composable codes, and let X =Y o Z. 
1. IfY and Z are prefix (suffix) codes, then X is a prefix (suffix) code. 
2. IfY and Z are complete, then X is complete. 
3. IfY and Z are thin, then X is thin. 


6.4 
The proof of (3) uses Lemma (Aa Wwhich cannot be established before Chapter [VI 
where more powerful tools will be available. 


LEMMA 7.4 Let Z be a thin complete code over A. For each word u € Z* there exists 
a word w E€ Z*uZ* having the following property. If mwn E€ Z*, then there exists a 
factorization w = sut with ms,tn € Z*. 


Proof of Proposition ER Y c B*, Z C A*, and let 8 : B* — A* be an injective 
morphism with 6(B) = Z. Thus X = (Y) =Y og Z. 

1. Assume Y and Z are prefix codes. Consider z, xu € X with u € A*. Since 
X C Z*, we have xz xzu € Z* and since Z* is right unitary, this implies u € Z* Let 
y = B-'(x),v = B-l(u) € B*. Then y,yv € Y and Y is prefix; thus v = 1 and 
consequently u = 1. This shows that X is prefix. The case of suffix codes is handled 
in the same way. 

2. Let w € A*. The code Z is complete, thus uwv € Z* for some u,v € A*. Let 
h = B-'(uwv) € B*. There exist, by the completeness of Y, two words u,v € B* with 
uho € Y*. But then G(tu)uwvG(v) € X*. This proves the completeness of X. 

3. If Z is not complete, then F(X) c F(Z*) Æ A* and X is thin. Assume now that 
Z is complete. The code Y is thin. Consequently F(Y) # B“ bet ue BY — F(Y), and 
u = G(u). Let w be the word associated to u in Lemma J7.4. Then w ¢ F(X). Indeed, 
assuming the contrary, there exist words m,n € A* such that 


r=munex C Z*. 
.6.4 
In view of Lemma frat 
x=msutn, with ms,tn € Z* = B(B*). 


Setting p = 87! (ms), q = B-(tn), we have pūq € Y. Thus ù € F(Y), contrary to the 
assumption. This shows that w is not in X, and thus X is thin. n 


We now consider the second aspect of the composition operation, namely the decom- 
position of a code into simpler ones. For this, it is convenient to extend the notation 
alph in the following way: let Z C A* be a code, and X C A*. Then 


alphz(X) = {z € Z | 3Ju,v € Z* : uzv E X}. 


In other words, alphz(X) is the set of words in Z which appear at least once in a 
factorization of a word in X as a product of words in Z. Of course, alph , = alph. The 
following proposition describes the condition for the existence of a decomposition. 


PROPOSITION 7.5 Let X,Z C A* be codes. There exists a code Y such that X = Y o Z 


if and only if 
XCZ* and alphg(xX)=Z. (1.7.2) 
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The second condition in (EA means that all words in Z appear in at least one 
factorization of a word in X as product of words in Z. 
Proof. Let X = Y og Z, where 3 : B* — A* is an injective morphism, Y C B* and 
B = alph(Y). Then X = (Y) C (B*) = Z* and further 6(B) = alphgg)(G(Y)), 
that is, Z = alphz(X). 

Conversely, let 8 : B* — A* be a coding morphi for,Z , and set Y = @-1(X). Then 
X c (B*) = Z* and (Y) = X. By Corollary Pat. a code. Next alph(Y) = B 
since Z = alphz(X). Thus Y and Z are composable and X = Y og Z. m 


We have already seen that there are two distinguished decompositions of a code X C 
A* as X = YoZ, namely X = Box and X = XoA. They are obtained by taking Z = X 
and Z = A in Proposition and assuming A = alph(X)]. These decompositions are 
not interesting. We will call indecomposable a code which has no other decompositions. 
Formally, a code X C A* with A = alph(X) is called indecomposable if X = Y o Z and 
B = alph(Y) imply Y = B or Z = A. If X is decomposable, and if Z is a code such 
that X = Yo Z, and Z £ X, A, then we say that X decomposes over Z. 


6.1 
EXAMPLE iF i (continued) The code X decomposes over Z. On the contrary, the code 
Z = {a,ba,bb} is indecomposable. Indeed, let T be a code such that Z c T*, and 
suppose T # A. Necessarily, a € T. ayi b Z T. But then ba, bb € T, whence Z C T. 


Now Z is a maximal code (Example thus Z = T. 
PROPOSITION 7.6 For any finite code X, there exist indecomposable codes Z1,...,Zn 
such that 


X= Z10:::0Z,. 
To prove this proposition, we introduce a notation. Let X be a finite code, and let 
&(X) = X (le|- 1) = SO |z|- Card(X). 
vex rex 


For each x € X, we have |x| > 1. Thus (X) > 0, and moreover ¢(X) = 0 if and only 
if X is a subset of the alphabet. 


PROPOSITION 7.7 If X,Z C A* and Y C B* are finite codes such that X = Y o Z, 
then €(X) > UY) + &(Z). 


Proof. Let 8 : B* — A* be the injective morphism such that X = Y og Z. From 
Card(X) = Card(Y) it follows that 


&(X)—&Y) = SF lel- SO lul = S206) - lyh). 


rex yey yeY 


Now |G(y)| = Mees |F(®)| + lulo- Thus 


UX) = CY) = E (El = Iyls)) = EAEOI- V ll) 


yEY bEB yEY bEB 
= OL- 1)- (X lye). 
beB yeY 
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By assumption B = alph(Y), whence -ey lylo > 1 for all b in B. Further |6(b)| > 1 
for b € B by the injectivity of 6. Thus 


UX) — Y) > FOL- 1) = Fle- 1) =Z). : 


bE B zEZ 


-6.6 
Proof of Proposition Fa ihe proof is by induction on (X). If (X) = 0, then X is 
composed of letters, and thus is indecomposable. If (X) > 0 and X is decomposable, 
then X = Y o Z for some codes Y,Z. Further Y and re not formed of letters 
only, and thus (Y) > 0, €(Z) > 0. By Proposition Fars have (Y) < €(X) and 
&(Z) < &(X). Thus Y and Z are compositions of indecomposable codes. Thus X also 
is such a composition. n 


-6.6 
Proposition astows the existence of a decomposition of codes. This decomposition 
needs not be unique. This is shown in 


EXAMPLE 7.2 Consider the codes 
X = {aa, ba, baa, bb, bba}, Y = {cc,d,dc,e,ec}, Z = {a, ba, bb} 
of Example AES we have seen, X = Y o Z. There is also a decomposition 
X=Y' o% Z 


with 
Y’ = {cc,d,cd,e,ce}, Z’ = {aa,b, ba} 


and y : B* — A* defined by 


The code Z is indecomposable, the code Z’ is obtained from Z by interchanging a and 
b, and by taking then the reverse. These operations do not change indecomposability. 
e 


EXAMPLE 7.3 This example shows that in decompositions of a code in indecomposable 
codes, even the number of components need not be unique. For X = {ab}, we have 


X = {cd} o {a?, ab} = {cd} o {u?, v} o {a, ab} 


and also 


X = {cd} o {a,b}. 


This gives two decompositions of length 3 and 2, respectively. ° 


6.2 
The code X in Example iz is neither prefix nor suffix, but is composed of such codes. 
We may ask whether any (finite) code can be obtained by composition of prefix and 
suffix codes. This is not the case, as shown in 
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EXAMPLE 7.4 The code X = {b, ba, a?b, abat} does not decompose over a prefix or a 
suffix code. 

Assume the contrary. Then X C Z* for some prefix (or suffix) code Z 4 A. Thus Z* 
is right unitary (resp. left unitary). From b,ba € Z*, it follows that a € Z*, whence 
A= {a,b} C Z* and A= Z. Assuming Z* left unitary, b, a?b € Z* implies a? € Z*. It 
follows that a°b € Z*, whence a € Z* and finally a € Z*. Thus again Z = A. ° 


We now give a list of pr rhies of codes which are inherited by_t S factors of a 
decomposition. Proposition E is in some sense dual to Proposition ; 


PROPOSITION 7.8 Let X,Y,Z be codes with X =Y o Z 
1. If X is prefix (suffix), then Y is prefix (suffiz). 
2. If X is maximal, then Y and Z are maximal. 


3. If X is complete, then Z is complete. 
4. If X is thin, then Z is thin. 


Proof. We assume that X, Z C A*, Y C B*, 6 : B* — A* an injective morphism with 
B(B) = Z, BY) = X. 

1. Let y, yu € Y. Then 8(y), B(y)G(u) € X, and since X is prefix, B(u) = 1. Now 8 
is injective, whence u = 1. 

2. If Y is not maximal, let Y’ = Y Uy be a code for some y ¢ Y. Then 8(Y') = 
B(Y) U By) is a code which is distinct from X by the injectivity of 3. Thus X is not 
maximal. 

Assume now that Z is not maximal. Set Z’ = Z U z for some z ¢ Z such that Z’ is 
a code. Extend B to B' = B Ub, (è Z B) and define 8 over B’*x by 3(b) = z. Then 8 
is injective by Proposition frifbecause Z' is a code. Further Y’ = Y U b is a code, and 
consequently 6(Y’) = X U z is a code, showing that X is not maximal. 

3. is clear from X* C Z*. 

4. Any word in Z is a factor of a word in X. Thus F(Z) c F(X). By assumption, 
F(X) # A*. Thus F(Z) # A* and Z is thin. m 


PROPOSITION 7.9 Let X,Y,Z be three codes such that X =Y o Z. Then 


Xis thin and complete = Y and Z are thin and complete. 


-6.3 
Proof. By Proposition fra The code X is thin and complete, oyiged Y and Z are. 
Assume conversely that X is thin and c mplete. Proposition Pa aos that Z À ae 


and complete. In view of Theorem , X is axial code. By Proposition [7.8, 

is maximal, and thus Y is complete (Theorem Ef gems to show that Y is thin. 
With the notations of the proof of Proposition [7.8, consider a word u ¢ F(X). Since 
Z* is dense, sut € Z* for some words s,t € A*. Thus sut = B(w) for some w € B*. But 
now w is not completable in Y, since otherwise hwk € Y for some h,k € B*, giving 
B(h)sutB(k) € X, whence u € F(X). Thus Y is thin. m 


6.9 
By Proposition [rok tor thin codes Y, Z, the code Y o Z is maximal if and only if Y 
and Z are maximal. We have no example showing that this becomes false without the 
assumption that Y and Z are thin. 


J. Berstel, D. Perrin and C. Reutenauer Version 22 janvier 2008 


86 I. Codes 


PROPOSITION 7.10 Let X be a maximal code over A. For any code Z C A*, the code 
X decomposes over Z if and only if X* C Z*. In particular, X is indecomposable if 
and only if X* is a maximal free submonoid of A*. 


Proof. If X decomposes over Z, then X* C Z*. Conversely, if X* c Z*, let. Z 
alphz(X). Then X c Z*, and of course, Z = alphz(X). By Proposition ; 
decomposes over Z. In view of Proposition rat the code Z is maximal. By Z C Z, we 
have Z = Z. n 


nll 


6. 


EXAMPLE 7.5 Let A be an alphabet. We show that the uniform code A” decomposes 
over Z if and only if Z = A™ and m divides n. In particular, A” is indecomposable for 
n prime and for n = 1. 

Indeed, let A” = X = Y og Z, where Y C B* and 8 : B* € A*. The code X is 
maximal and bifix, and thus Y also is maximal and bifix and Z is maximal. Let y € Y 
be a word of maximal length, and set y = ub with b € B. Then Y U uB is prefix. 
Let indeed y’ = ub’, b' € B. Any proper prefix of y’ is also a proper prefix of y, and 
therefore is not in Y U uB. Next if y’ is a prefix of some y” in Y U uB, then by the 
maximality of the length of y, we have |y’| = |y”| and y/ = y”. Thus Y U uB is a code. 
Hence Y U uB = Y, because Y is maximal. It follows that G(uB) = B(u)Z c X. Now 
X is a uniform code, thus all words in Z have the same length, say m. Since Z is 
maximal, Z = A™. It follows that n = m - |y]. e 


8 Prefix graph of a code 


The prefix graph is a graph used to give an efficient test whether a set X is a code. 
The graph can also answer some other questions on the set X, by applying standard 
techniques for graph traversal. This will be detailed in later chapters. 

Let X be a finite set of words over some alphabet A. We define a graph Gx for X, 
called the prefix graph of X as follows. The vertices of Gx are the nonempty prefixes 


of words in X, and there is an edge from s to t if and only if one of the two. followin dii 
situations occurs: either st € X or sx = t for some x € X, see Figure i 


Figure 1.11: The two types of edges in a prefix graph. 


Edges of the first type are called crossing, those of the second type extending. A 
crossing edge (s,t) is labeled with the word t, an extending edge (s,t) with sx = t is 
labeled with z. As usual, the label of a path is the product of the label of its edges. In 
the case where sx = t and x,t are in X, then (s,t) is an extending edge labeled with 
x, and (s,2) is a crossing edge, also labeled with zx. 

A vertex s is intended to represent a prefix that has been constructed in the process 
of trying to build a double factorization, say ys = z, for y,z € X*. A crossing edge 
(s,t), with st = x € X, gives the factorization yx = zt, and the prefix t swapped to the 
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other side of the equation, whereas an extending edge (s,t) with sx = t merely Mee subiera 


the factorization by yt = zx, extending the current prefix from s to t. See Figure i 


= > 


Figure 1.12: The two ways of continuing a double factorization ys = z. On the left, it 
is extended to yx = zt, and on the right to yt = za. 


EXAMPLE 8.1 Let X = {a, bb, abbba, babab} over the alphabet A = {a,b}. The non- 


fig-doublefact 


empty prefixes, in addition to the words in X, are the words b, ab, ba, abb, bab. abl Dee ieee 
and baba, so the graph has 11 vertices. The prefix graph G y is given in Figure 


2. 
pp abo ae ty 
bab b 
. bab —~q— baba 
nn “ab 
Gs, t 
a abbb | ab 


babab 


Figure 1.13: The prefix graph Gx for the set X = {a, bb, abbba, babab}. A crossing edge 
is drawn dashed, an extending edge is drawn filled. The label of a crossing edge is the 
name of its endpoint. The label of an extending edge (s,t) is the word x in X for which 
sx =t. 


We will prove that the set X is a code if and only of there is no path in the prefix 
graph Gx from a vertex in X to a vertex in X. In our example, there is a path from 
a to itself, or to abbba, so the set is not a code. 

We start with a lemma describing paths in the prefix graph Gy. First, we need a 
definition. Two factorizations (%1,...,@,) and (y1,..., Ym) of a word are disjoint if 
zi eti AY yj for l<i<n,1<j<m. We say simply that 


Ty En = Y1: Ym 


is a disjoint double factorization when the two factorizations (£1,...,£n) and (y1, 
.-, Ym) of the same word are disjoint. 
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LEMMA 8.1 There is a path of length n > 1 from s tot in the prefix graph of X if and 
only if there exist £1,..., £k, Y1,---,Ye in X such that 


are disjoint factorizations with k + L = n, and moreover s is a prefix of xı (resp. a 
prefiz of t if k = 0). The label of the path is yı --- yet in the first case and y,--+ ye in 
the second case. The first (second) case occurs if and only if the path contains an odd 
(even) number of crossing edges. 


EXAMPLE 8.2 Consider as an example the path 


ba bab 


abb 2% ba 2 bab @ ab @ abbd 


in the previous graph. It is represented in the following picture. 


babab 


ab bbalba ba bib bl 


This path has length 4, the first 3 edges are crossing edges, the last one is an extending 
edge. It corresponds to the disjoint factorizations abb|babab|abbb = abbba|babab|bb. Here 
L= 1, k = 3, and the product of labels is babababbb. The path 


a. abb & ba 2 bab © ab © abbbb & a 


has two more edges. 


alb bib aba bla b bba 


a b bbaba babib bla 


It corresponds to the disjoint factorizations a|bb|babab|abbba = abbba|babab|bb|a which 
shows that X is not a code. 


Proof of the lemma. Assume first that there is a path of length n > 1 from s to t. If 
n = 1, then either st = x, or sx = t with x € X. Thus there is a double factorization 
of the desired form for n = 1. 

Assume now n > 1, and that there is edge from t to u. By induction, sy1--- yet = 
Lys LE OY SY +++ Ye = T1: Xpt, and either tu = x € X or tx = u for some x € X,u ¢ 
X. So there are four cases to check. 

If syy--- yet = 21 --- £k and tu = x E€ X, then sy --- yx = 21 ---xpu, and these 
factorizations are again disjoint because u is a proper suffix of x. 

If sy, --- yet = 21 --- £k and tr = u for some x € X, then sy, --- yeu = z1: px and 
again the factorizations are disjoint because u is a proper suffix of t, so of xp. 

If syy---ye = £1: apt and tu = x E€ X, then sy,---ygu = z1- zg£ and the 
factorizations are disjoint because u is a proper suffix of x. Moreover, if k = 0 then s 
is a prefix of x because s is a prefix of t and t is a prefix of x 

Finally, if sy,---ye = %1-°--xpt and tx = u for some x € X, then sy,--- yer = 
a1---a,u. The factorizations are again disjoint. If k = 0, then s is a prefix of t and t 
is a prefix of u, so the word s is a prefix of u. 
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Conversely, assume that there are a double factorization sy, --- yet = £1- £k or a 
double factorization sy,--- yg = £1: zkt, with k+ £L =n. If n = 1, then k= 1,2 = 0 
in the first case, and k = 0, = 1 in the second case. Indeed, the value k = 1,4 = 0 in 
the second case is ruled out by the condition that s is a prefix of xı. Thus, there is a 
crossing edge (s, t) in the first case, and an extending edge (s, t) in the second case. 

Assume n > 1 and sy1:-- yet = 41 ++- Xp. Since t Æ £p one of these words is a proper 
suffix of the other. Suppose first that t is a proper suffix of £k, and set x, = ut. Then 
there is an edge from u to t in Gx and moreover sy1--- Ye = %1 °°: k—1U. If k = 1, then 
s is a proper prefix of u, otherwise s remains a proper prefix of xı. Thus the induction 
applies and there is a path from s tu u of length n — 1, whence a path of length n from 
sto t. Assume next that x, is a suffix of t and set t = uzg. This is defines an extending 
edge (u, t). Thus sy1--: ygu = z1 :--zk—1. Since the left-hand side is not empty, s is a 
prefix of xı. The conclusion again follows by induction. 

If the double factorization is sy,--- ye = z1 -+ £kt, then since s is a proper prefix of 
the right-hand side, one has £ > 0. 

If ye is a proper suffix of t, then t = uy, for some u and there is an extending edge 
(u,t). Replacing t by uye gives sy,---ye_1 = %1--: au. Either s is a prefix of xı, or 
k = 0, and then s is a proper prefix of u if 2 > 1 or s = u if £ = 1. In the first case, 
there is a path from s to u, in the second case there is just the edge (s, t). 

Finally, suppose that t is a proper suffix of ye. Then yg = ut and thus there is a 
crossing edge (u,t). Next, sy1 <+- u = %1---2,, SO k > 1 and s remains a prefix of z1. 
There is again a path from s to u of length n — 1 by induction. This completes the 
proof. n 


THEOREM 8.2 A set X of nonempty words is a code if and only if there is no path in 
its prefix graph from a vertex in X to a vertex in X. 


Proof. Assume there is a path from s € X to t € X in the prefix graph Gx. Then = 

there exists a disjoint double factorization of one of the forms described in Lemma h it 

In both cases, this gives a double factorization of a word as a product of words in X. 
Conversely, assume that X is not a code, and consider a shortest word w in X* that 

has two distinct factorizations 


W = Titt In = Y1: Ym 


with 71,...,2%n,Y1,---;Ym in X. We may assume that zı is a proper prefix of y1. Then 
there exists a path from zı to ym of length m +n — 2 in Gy. f 


Given a finite graph G, many properties of G can be checked in linear time with 
respect to the size of G, where the size is the total number of vertices and edges of 
G. Among these properties are the existence of cycles, the existence of paths between 
distinguished sets of nodes, and so on. All properties described in the previous section 
are of these kind. This requires to estimate the size of the graph Gx of X. 


PROPOSITION 8.3 Let X be a finite set of words with n elements, and let N = X` „ex |z| 


be the sum of the lengths of the words in X. The prefix graph Gx has at most N vertices 
and at most nN edges. 
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Proof. The vertices of Gx are the nonempty prefixes of words in X; there are at most 
N — 1 of them. Next, consider a vertex t and an edge (s,t) entering t. If (s,t) is a 
crossing edge, then st € X is longer that t, and if t = sx for some x € X, then z is 
shorter than t. So a word x in X either contributes at most one crossing edge, or it 
contributes at most one extending edge. So the total number of edges entering t is at 
most n, and the total number of edges in Gy is at most nN. n 


COROLLARY 8.4 Given the prefix graph Gx of a set X ofn words of total length N, it 
can be checked in time O(nN) whether X is a code. 


Proof. This is a direct consequence of the previous discussion. n 


It remains to show how to construct the prefix graph Gx of a finite set X in linear 
time with respect to its size, that is with respect to nN, where n is the number of 
words in X, and N is the sum of the lengths of the words in X. 

The construction is in three steps. First, a simple automaton recognizing X is con- 
structed. This automaton is deterministic but not complete, and has the shape of a 
tree. Such an automaton is usually called a trie. The vertices of Gx are among the 
states of this automaton. Next, the automaton is converted into what is called a pat- 
tern machine. This is done in equipping the trie with a failure function. The role of 
this function is to provide, in the case a transition does not exist for some letter in 
some state, another state where one can look for a possible transition. As a result, the 
pattern matching machine recognizes, with the aid of the failure function, the set A* X 
of words ending in a word in X 

These two preliminary steps are used, in the final step, to compute efficiently the 
edges of the graph Gx. 

Given a finite set X of words over the alphabet A, the trie of X is the automaton 
whose set of states is the set P of prefixes of words in X. The initial state is the empty 
word, the end states are the words in X. The next state function is defined for p € P 
and a € A if and only if pa is in P, and then p- a = pa. 

The trie of X can be constructed very simply by inserting the words of X into a tree 
that is initially reduced to the empty word. 


TRIE(X) 
1 T — NEW AUTOMATON() 
2 forzxexX do 


3 pre 
4 for i — 1 to |x| do 
5 a — x{i| 
6 if p - a exists then 
7 p— p-a 
8 else q — NEW STATE() 
9 p-as q 
10 p—q 
11 SETTERMINAL(p) 
12 return T 
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This algorithm clearly computes the trie in time O(N), where N is the sum of the 
lengths of the words in X. 


pe 
EXAMPLE 8.3 The trie of X = {a, bb, abbbba, babab} is given in Figure EE 7 


oo OF OO 


OG HO GO 7-0 


Figure 1.14: The trie of X = {a, bb, abbbba, babab}. Viewed as an automaton, it accepts 
words in X. 


Given a finite set X of words over the alphabet A, the failure function is intended to 
be used when the next-state function p-a is undefined in the trie of X. It gives a state 
q where a new trial for the computation of the next state should be started. 

The failure function f of X is defined on the set of nonempty prefixes of X. For 
pé€P,p#e, f(p) is the longest suffix of p which is in P. For the empty word, 
fle) =e. 

The pattern matching machine of X is the automaton derived from the trie of X by 
extending the next-state function on P by 


pa if pa € P, 
pa= i 
f(p)-a otherwise. 


Moreover, the state p is terminal if f(p) is terminal. The function COMPUTEFAIL- 
URE(T) computes the failure function for the trie T. 


COMPUTEFAILURE(T) 
1 f(e)=e 
2 F — NEW QUEUE() 
3 for a € A such that € -a is defined do 
4 fle-a)ce 
5 ADD(F,€-a) 
6 while F 4@ do 
7 p — GET(F) 
8 if ISTERMINAL(f(p)) then 
9 SETTERMINAL(p) 
10 for a € A such that p -a is defined do 
11 q — f(p) 
12 while q : a is undefined do 
13 q= f(q) 
14 f(p-a)—q-a 
15 ADD(F, p-a) 
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The pattern matching machine is obtained by construction first the trie, and then 
the failure function. 


Figure E 84 The pattern matching machine of X = {a, bb, abbbba, babab} is given in 


Figure ; 
Q000 


rO OOO 


Figure 1.15: The pattern matching machine of X = {a, bb, abbbba, babab}. Viewed as 
an automaton, it accepts words in A*X. Its accepting states are in gray. The failure 


function is represented by dotted edges. 


A state p is terminal for the pattern matching machine if it is a word in A*X. It 
appears useful to know the longest suffix of the state p that is in X. Call this o(p). 
The function ø is undefined on non terminal states, and for terminal states, is is given 
by 


_ Jf) if f(p) isin X, 
(p) = ‘i f(p)) otherwise. 


This shows that, provided we remember those states that are in X, is is quite easy, 
and linear with respect to the number of states, to compute the function o. 

We are now ready to compute the edges of the graph Gx. Each word x in X may 
produce several crossing edges (s,t). This is a crossing edge provided the suffix t is 
also a prefix of a word in X. All these suffixes are enumerated by the failure function. 
Thus one gets the following function for computing the crossing edges: 


CROSSINGEDGES(X ) 
1 forxe xX do 
2 t— f(z) 
3 while t 4 £ do 
4 s at! 
5 ADDCROSSINGEDGE(s, t) 
6 t— f(t) 


The only tricky line is the computation of the vertex corresponding to the word «t~!. 
This may be done by maintaining, for each x in X, an array of pointers to the vertices 
of its prefixes, indexed by there length. So, from the length of x and the length of t 
one obtains the length of s, thus s in constant time. 

The computation of extending edges is quite similar. Given a suffix t, we look for all 
suffixes x of t. Each of these suffixes gives an extending edge (s,t), with sz = t. To 
loop through the suffixes of t which are in X, one iterates the function ø. Thus the 
function is 
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EXTENDINGEDGES(X ) 
1 for tterminal states do 
2 xz — o(t) 
3 while x Æ £ do 
4 ai! 
5 ADDEXTENDINGEDGE(s, t) 
6 Le a(x) 


Again, the tricky point is the computation of s = ta~!. Do do this, one maintains for 
each vertex p a pointer to the longest word in X for which p is a prefix. In the present 
case, s is a prefix of t, so they share the same longest word in X, and the trick of the 
array used previously applies again to give the vertex of s in constant time. 

Altogether, the following function computes the prefix graph of the set X. 


PREFIXGRAPH(X) 
1 T — TRIE(X) 
2 COMPUTEFAILURE(T) 
3 CROSSINGEDGES(X ) 
4 EXTENDINGEDGES(X ) 


PROPOSITION 8.5 Given a set X of n words over some alphabet A, of total length 
N = $ex |z|, the prefix graph Gx can be constructed in time and space O(nN). 


9 Exercises 


g section1.1 
Section 


1.1 Let n > 1 be an integer. Let I,J be two sets of nonnegative integers such that 
for i, i € I and j}, f} € J, 
i+j=ť +j modn 
implies i = i', j = j’. Let 
Y = {aba | ic I,j € J} 
and X = Y Ua”. Show that X is a code. 


P section1i.2 
Section 


.2.8 
2.1 Show directly (that is without using Theorem at a set X = {x,y} is a code 
if and only if x and y are not powers of a single word. 


2.2 Let K be a field and A an alphabet. Denote by K/A) the subalgebra of K((A)) 
formed by the series o € K((A)) such that (o, w) = 0 for all but a finite number of 
words w € A*. Let X C At be a code and 8 : B* — A* be a coding morphism for X. 
Extend 8 by linearity to a morphism from K(B) into K(A). Show that 6 defines an 
isomorphism between K(B) and the subalgebra of K(A) generated by the elements of 
X. 
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2.3 Show that a submonoid N of a monoid M is stable if and only if for all m,n € M 
we have 
nm,n,mn EN > mEN. 


2.4 Let M be a commutative monoid. Show that a submonoid of M is stable if and 
only if it is biunitary. 


2.5 For X C A? let Y be the base of the smallest right unitary submonoid containing 
X. 
(a) Show that Y c (Y+)! X. 
(b) Deduce that Card(Y) < Card(X), and give an example showing that equality 
might hold. 


2.6 Let X be a subset of At. Define a sequence (Sn)n>0 of subsets of A* by setting 
So=X*, Sho = [S7 Sn 95,5, T. 


Set S(X) = UnsoSn. Show that S(X) is the free hull of X. Show that when X is 
recognizable, the free hull of X is recognizable. 


2.7 . Let M be a submonoid of A* and let 
X =(M—1)-(M-1)? 


be its minimal set of generators. Show that X is recognizable if and only if M is 
recognizable. 


2.8 Let M be a monoid. Show that M is free if and only if it satisfies the following 
conditions: 
(i) there is a morphism à : M — N into the additive monoid N such that \~!(0) = 1, 
(ii) for all x, y,z,t E€ M, the equation xy = zt holds if and only if there exists u € M 
such that cu = z,y = ut or x = zu, uy = t. 


. ction1.3 
Section 


3.1 Let X be a subset of At such that XN XXT =. Define a relation p C A* x A* 
by (u,v) € p if and only if there exists x € X* such that 


uzv E X, urAl, wFl, rv Æl. 


Show that X is a code if and only if (1,1) ¢ p*, where pt denotes the transitive closure 
of p. 


. section1.3bis 
Section 


4.1 Show that a submonoid M of A* is recognizable and free if and only if there exists 
an unambiguous trim finite automaton A = (Q, 1,1) that recognizes M. 
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. section1.4 
Section 


5.1 Let D be the Dyck code over A = {a,b}. 


(a) Show that the following equality holds 
D = aD%b + bDja 


with Da = DnaA*b, Dy = D N bA*a. 
(b) Derive from the previous equality another proof of 


d(t) =1— y1- 4pqt?, 


-4.5 
where d(t) is the series defined in Example bot 


5.2 Let n > 1 be an integer and J, J be two subsets of {0,1,...,n — 1} such that for 
each integer p in {0,1,...,n — 1} there exist a unique pair (7,7) € I x J such that 


p=it+tjmodn. 


Let 
V ={i+j-n|icIl, jeJ, i+j>n}. 


For a set K of integers, let 
a = {a" | kE K}. 


Let X C {a,b}* be the set 
X =a! (ba" "ba? Ua". 


Show that X is a maximal code. 


g section1.5 
Section 


6.1 Show that the set X = {a°,b, ab, ba”, aba? } is complete and that no proper subset 
of X is complete. Show that X is not a code. 


6.2 Let p be a prime number. Let I,J be two subsets of {0,1,...,p — 1}, both 
containing 0 such that for each integer q there exists at most one pair (i,j) € I x J 
such that q = i + j(modp). 

(a) Show that X = a? U {atb | i € I} U {bal | j € J} is a code. 
(b) Show that if Card() > 2 and Card(J) > 2, then X is not contained in any finite 
maximal code. 


6.3 Let M be a monoid. Let F be the family of subsets of M which are two-sided 
ideals of M or empty. 
(a) Show that there is a topology on M for which £F is the family of open sets. 
(b) Show that a subset P of M is dense in M with respect to this topology if nd on ie 
only if F(P) = M, i.e., if P is dense in the sense of the definition given in Section jetons 
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5.2 
exo1.5.4| 6.4 With the notations of Proposition 6.4, and V = A* — A*yA*, show successively 
that 


A*= (V yY = (U y) (X “yU yY) 
= (U yV +U yÊ YU yY. 


(Use the identity (o + T)* = r*(or*)* = (o*rT)“a* for two power series 0,7 having no 
constant term). Derive directly from these equations the fact that Y is a code and that 
Y is complete. 


6.5 Show that each recognizable code is contained in a maximal recognizable code. 


6.6 Show that each thin code is contained in a maximal thin code. 


A lsection1.6 
Section 


7.1 Let y : A* — G bea morphism from A* onto a group G. Let H be a subgroup of 
G and let X the group code defined by X* = ~~ !(H). Show that X is indecomposable 
if and only if H is a maximal subgroup of G. 


7.2 Show that any code X = {x,y} with two elements is composed of prefix and 
suffix codes. 


7.3 Show that the code X = {a, aba, babaab} is not obtained by composition of prefix 


and suffix codes. Show that it is contained in the finite maximal code Y given by 
Y — 1 = (1 +b + baba(1 +a +b))\(a+b-— 1)(1 + ba). 


-Obis.4 
Show that Y belongs to the family of finite maximal codes defined in Exercise kk fit i = 


10 Notes 


The concept of a code originated in the theory of communication initiated by C. Shan- 
non in the late 1940s. In this framework, the development of coding theory lead to a 
detailed study of constant length codes in connection with problems of error detection 


and correction. An account of this research can be found in McWilliams and Sloane 
(1977) or (L983). A classical book on information and communication theory 
is [AsH (1965). See also Csiszar and Körner, Janod (981) and McEliece] (L977). 


Variable-length codes were investigated in depth for the first time by 
(1955) and also by Gilbert and Moord (1959). The direction followed by Schützenberger 


consists in linking the theory of codes with classical noncommutative algebra. The 
results presented in this book represent this point of view. An early account of it can 


be found in (966). 

The notion of a stable subm noid appears for the first time in ({L955)) 
which contains Proposition Bathe same result is also give jn Bhevrin (L960), 
and (1965). A detail q Rroof of Proposition Bape oars in (L972). 


The defect theorem (Theorem as been proved in several formulations in 
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. Some generalizations 
are discussed in 1979). For  ręlated o see T (L976). 

The test for codes given in err: 3 Sees back to (L953) and is 

in fact usually known as the Sardinas and Patterson algorithm. The proof of correctness 

is surprisingly involved and has motivated a number of papers (L963), 

(964), (L967), and (L976). The design of an efficient 

algorithm is described in (L976). See also (L982) and 

(1984). The problem of testing whether a recognizable set is a code is a 

special case of a well-known problem in automata theory, namely testing whether a 

given rational expression is unambiguous. Standard decision procedures exist for this 

question see Eilenberg] (L974) and (1974). These techniques will be used in 

Chapter VI e connection between codes and rational expressions has been pointed 

out in (1967). Further, a characterization of those codes whose coding 


morphism preserves the star height of rational expressions is given in Hashiguchi and 
(19764). 


ction1.4 -4.3 
The results of Section Fi n wel T know in information theory. Corollary patwith its 


converse stated in Theorem are known as the Kraft-McMillan theorem (McMillan 


(L956). 
tion1.5 
The main results of Section i rea (E955). Our ee is 


slightly more general. Proposition and Exercise [6.9 are due to | e cht and 


. They eam ae a Question of |Restivol Restivo] (£977). Theorem 6.13 pe in 


is a special case of a construction due to Restivo 


), Exercise Es 1s kn. as Levi’s on pa evi 
(L975). Exercise icin Mestia Restivo] (fl 
arnt a class of =e which are not contained in any finite maximal code. ae 
results in this direction can be found in (L985). ee 
The construction of a trie equipped with a failure function in Section Rf fottows (Aho 
(1975). The resulting structure is called the pattern matching machine. 
The presentation of the algorithm follows closely the description given in 
(1984), see also (Capocelli and Hoffmann}, [[985). These papers contain the transcrip- 
tion to prefixes of the implementation of (Apostolico and Giancarlo} {1984). Similar 
implementation to (Hoffmann, |1984)) are given in 1995). The 
implementation proposed in ([Rodeh} L982) gives the same bounds but is more involved. 
It is based on the suffix tree, that is a compact tree representing all suffixes of a finite 
set of words. 
a exact SN of testing unique decipherability is still unknown, see (Galil| 
EN „for discussion and partial results. 
Lue = (L996). It is a counterexample to a conjecture 
in asserting that every three-word code is composed of prefix 
2 eS suffix ae = is an known whether any three-word code is contained in a finite 


(L972), 1976), and 


maximal code. 
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Chapter II 


Prefix Codes 


0 Introduction 


Undoubtedly the prefix codes are the easiest to construct. The verification that a 
given set of words is a prefix code is straightforward. However, most of the interesting 
problems on codes can be raised for prefix codes. In some sense, these codes form a 
family of models of codes : frequently, it is easier to gain intuition about prefix codes 
rather than general codes. However, we can observe that the reasoning behind prefix 
codes is often valid in the general case. 

For this reason we now present a chapter on prefix codes. In the first section, we 
comment on their definition and give some elementary properties. We also show how 
to draw the picture of a prefix code as a tree (the literal representation of prefix codes). 

In Section Ø, a construction of the automata associated t refix, codes is given. 
These automata are deterministic, and we will see in Chapter PRS to extend their 
construction to general codes. 

The third section deals with laximal prefix codes. Characterizations in terms of 
completeness are given. Section eens the usual operations on prefix codes. Most 
of them have an easy interpretation as operations on trees. ; 

ction2.5 
An important family of prefix codes is introduced in Section FP “Ttey have many com- 


binatorial properties which illustrate the no ions, ; presented previously. The synchro- 
fi act, this notion will be generalized 


nization of prefix codes is d ned y Section 

to general codes in Chapter te a the wie onadio with groups will be established. 
The notion of measure of a prefix, code, can be amplified by a definition of average 
length. This is done in Section i where a combinatorial equivalence of average length 
is given. 


1 Prefix codes 


This introductory section contains equivalent formulations of the definition of a prefix 
code together with the description of the tree associated to a prefix code. We then 
show how any prefix code induces in a natural way a factorization of the free monoid. 
Of course, all results in this chapter transpose to suffix codes by using the reverse 
operation. 
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It is convenient to have a shorthand for the proper prefixes (resp., proper suffixes) of 
the words of a set X. For this we use 


RAS X(A*)! and A7 X = (A)! X. 


A subset X of A* is prefix if X N XAT = 0. If X contains t gmpty word 1, then 
X = {1}. In the other cases, X is a code (Proposition [fl.4)- There is a series of 
equivalent definitions, all of which will be useful. 


PROPOSITION 1.1 For any subset X of A*, the following conditions are equivalent: 
(i) XNXAT=6, 


) 
(iii) XA, X, XAT are pairwise disjoint, 
) if x, zu E€ X, thenu=1, 
) if xu = x'u' and z,x' € X, then x = x' andu =w . n 
The following proposition can be considered as describing a way to construct prefix 
codes. It also shows a useful relationship between prefix codes and right ideals. 


PROPOSITION 1.2 
1. Let L be a subset of A* , and let 


X =L\LA+. (2.1.1) 
Then X is prefix, and it is a code if and only if 1 ¢ L. Moreover 
XA* = LA*, (2.1.2) 


that is, X and L generate the same right ideal. 

2. Let X be a prefix set, and let M be a subset of A* such that X A* = M A*. Then 
X =M\ MA. In particular X C M, and X is the minimal set of generators 
of the right ideal M A*. 


Proof. 1. From X C L, it follows that X At C LAT, whence XN X At c XNLAtT =Í. 
Further 1 € X if and only if 1 € L. This proves the first statement. Next, we claim 
that 

ic XA". (2.1.3) 


Indeed let u € L. If u € X, then u € X A*. Otherwise u € LAT, whence u = u'w for 
some u’ € L, w € At. Arguing by induction on the length, we get that u’ € XA*. 


Thus, u also is jn X A*. 
Fro Hes it follows that LA* C X A*. The reverse inclusion is clear. This proves 


2. Let x be an element of X. Then z = mu for some m € M, u € A*. Similarly, 
m = x'v for some x’ € X, v € A*. It follows that x = 2/vu, whence vu = 1, since X 
is prefix. Thus X C M. Next (XA*)A = (M A*)A, showing that XAt = MA”. This 
implies that X C M \ MA*. To show the converse inclusion, let m € M \ MA. Then 
m = xu for some x € X, u € A*. Assuming u Æ 1, it follows that m € XAT = MAT, 
which is not the case. Thus, u = 1 and m € X. This completes the proof. n 


The set X = L \ LA® is called the initial part of L or also the base of the right ideal 
LA*. 
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COROLLARY 1.3 Let X and Y be prefix subsets of A*. If X A* = Y A*, then X =Y. 


EXAMPLE 1.1 Let A = {a,b} and let L = A*aA* be the set of words containing at 
least one occurrence of the letter a. Then 


X = L\ LAŤ =b*a 


EXAMPLE 1.2 Let A = {a,b} and let X = A*ab\ A*abA*. Clearly X is a prefix code. 
It is the set of words ending with ab that do not contain another occurrence of the 


factor ab. Thus, X = b*a*ab. This code, as does k Brevious one, belongs to the 
tion2. 


family of semaphore codes that are studied in Section 


We now give a useful graphic representation of codes, especially of prefix codes over 
an alphabet with 2 or 3 letters. It consists of associating a tree with each prefix code 
in such a way that the leaves of the tree represent the words in the code. 

First, we associate an infinite tree with the set A* of words over an alphabet A 
as follows. The alphabet is totally ordered, and words of equal length are ordered 
lexicographically. Each node of the tree represents a word in A*. Words of small 
length are to the left of words of greater length, and words of equal length are disposed 
vertically according to lexical ordering. There is an edge from u to v if and only if 
v = ua for some jer a The tree obtained in this way is the literal representation 
of A* (see Figs. pe poe 


aaa 
aa 
a 
ab a 
ae 
b = sa 
bb a 
bbb — 


Figure 2.1: The literal aiin of {a,b}*. 


To a given subset X of A* we associate a subtree of the literal representation of A* 
as follows. We keep just the nodes corresponding to the words in X and all the nodes 
on the paths from the root to these nodes. Nodes corresponding to words in X are 
marked if neces ‘sary. The tree obtained in this way is the literal representation of X. 

eae 2-0 


Figures 2.3-2.6] give several examples. 
p 


It is easily seen that a code X is prefix if and only if in the literal representation of 
X, the nodes corresponding to words in X are all leaves of the tree. 
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Figure 2.2: The literal representation of {a, b,c}*. 


a 
baa 
ba 


Figure 2.3: Literal representation of X = {a, ba, baa}. 


< 


Figure 2.4: Literal representation of X = {aa, ab, baa, bb}. 


= 


Figure 2.5: Literal representation of X = {a, b, ca, cbba, cbcc}. 


a 


Figure 2.6: Literal representation of X = a* 


The advantage of the literal representation, compared to simple enumeration, lies 
in the easy readability. Contrary to what might seem to happen, it allows a compact 
representation of relatively big codes (see Fig. pa 

After this digression, let us state the factorization theorem announced at the begin- 
ning of this section. 
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THEOREM 1.4 Let X be a prefix code over A and let 
R= A*\ XA*. 
Then R is prefiz-closed and nonempty. Further, 


1+RA=R+X, 
A*=X*R. 
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(2.1.4) 
(2.1.5) 
(2.1.6) 


Conversely, if R is a prefix-closed subset of A*, then X = RA\ R is a prefix code. If 


R £0, then it is the unique prefix code such that R = A* \ X A*. 


Proof. If a word w has no prefix in X, then each of the prefixes of w shares this property. 
Thus, R is prefix-closed. The set R is De a en X # {1}. The product of X 


and A* is unambiguous in view of Proposition v). Thus, XA* = X A* and 


R =A\XA =A*-XA*=(1-X)A*. 


= 


Figure 2.7: A code with 26 elements. 


From this equation and 
(1 = A)? = A* ’ 


we get, by multiplying (ray by 1-A, 
R(1-A)=1-X, 

which is formula Ey This also gives 
R-RA=1-X, 


.5 
from which ollows. Finally, note that 


Pee Cae 9 a 


(2.1.7) 


J. Berstel, D. Perrin and C. Reutenauer Version 22 janvier 2008 


104 II. Prefix Codes 


1 -T 
by Proposition MAI Multiplication of (Era by X * gives 


XR- ed". 


This proves (ae 
Conversely, let R be a prefix-closed subset of A*. If R = @ then X = RA\ R= f is 
a prefix code; thus we may assume R nonempty . Let X = RA \ R. Then X is prefix. 
Indeed if x € X, then x = ra for some r € R, a € A. Each proper prefix u of x is a 
prefix of r, thus, it is in R and therefore not in X. This shows that X N XA- =f. 
With R being prefix-closed, 1 belongs to R and 


X=RA-R+1. 


Indeed the product RA is unambiguous, and furthermore 1 ¢ RA. It successively 
follows from the formula that 


X-1=R(A-1), R=(1-X)A*=A*—XA*=A"\ XA". 


The last equality holds because X is prefix. Assume finally that there is a prefix code 
Y such that 
R=A*\ YA =A*\ XA. 


243 
Then Y A* = X A*, whence Y = X by Corollary fea this proves uniqueness. n 


Note the following combinatorial interpretations of formulas Ei ana (tay The 
first says that a word in R followed by a letter is either in R or in X and that each 
word in X is composed of a word in R followed by a letter. The second formula says 
that each word w € A* admits a unique factorization 


W = T1T2'''LnU, 1,...,% EX, UER. 


Figure 2.8: A prefix code. 


Figure 2.9: Another prefix code. 
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8 
EXAMPLE 1.3 Let X = {a, baa, bab, bb} C A* be the code represented in Fig. ES Here 
R={l1,b,ba} = XA, and X —1=(1+6+56a)(A — 1). The vn Fess R and 


X A` characterizes maximal prefix codes, as we will see in Section f. 


9 
EXAMPLE 1.4 Let X = (b?)*{a?°b, ba}, as given in Fig. Bor Here R = Rı U R2, where 
Rı = XA~ = (bÞ)*(1 UaU bU a?) is the set of pro ey prefixes of X and Rz = 
XAt — X — X A` = (b*)*(abA* U a? A*). Thus Eq. Pow becomes 


X -1=(0*)*(1+a+b+a7+abA*+a°A*)(A —1). 


2 The automaton of a prefix code 


The literal representation yields an easy method for verifying whether a word w is in X* 
for some fixed prefix code X. It suffices to follow the path starting at the root through 
the successive letters of w. Whenever a leaf is reached, the corresponding factor of w 
is split away and the procedure is restarted. 

We will consider several automata derived from the literal representation and relate 
them to the minimal automaton. The study which is initiated here will be generalized 
to arbitrary codes in Chapter pie particular case of prefix codes i interesting in 
itself because it is e Origin of most of the general results of Chapter pee 

Recall (Chapter Fi that for any subset X C A*, we denote by A(X) the minimal 
deterministic automaton recognizing X. 


PROPOSITION 2.1 Let X be a subset of A*. The following conditions are equivalent: 
(i) X is prefix. 
(ii) The minimal automaton A(X) is empty or has a single final state t and t- A = 0. 
(iii) There exist a deterministic automaton A = (Q,i,T) recognizing X with T-A = 0. 


Proof. (i) = (ii). Suppose that X is nonempty. Set A(X) = (Q,i,T). First, we claim 
that for q € T , we have {w € A*|q-w eT} = {1}. Indeed let x € X and w € A* be 
words such that i- x = q (remember that q € T) and q: w € T. Then zw € X, whence 
w = 1. This shows the claim. 

Thus, two final states are not separable and from the minimality of A(X), it follows 
that A(X) has just one final state, say t. Assume that t- A #4, and that t- a = p for 
some letter a € A and some state p. Since p is coaccessible, we have p -v = t for some 
v € A*. Thus t- av = t, whence av = 1, a contradiction. 

(ii) = (iii) is clear. 

(iii) > (i). From T - A = 9, it follows that T - At =. Thus, if x € X, and w € AT 
then i- zw = and zw ¢ X. Thus XN XAt =9. / 


It is easy to construct an automaton for a prefix code by starting with the literal 
representation. This automaton, call it the literal automaton of a prefix code X, is the 
deterministic automaton 


A=(XA UX,1,X) 
defined by 


ua ifuae XA UX, 
Ua 
Ø otherwise. 
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Since X A~ UX is prefix-closed, we immediately see that 1-u € X if and only if u € X, 
that is L(A) = X. The pictorial representation of a literal automaton corresponds, of 
course, to the literal representation of the code. 


EXAMPLE bag code X = {ab, bab, bb} over A = {a,b} "e: N literal representation 


given in Fig. and the literal automaton given in Fig. 


Figure 2.10: (a) Literal representation of X, (b) Literal automaton of X. 2_ 


The literal automaton A of a prefix code X is trim but is not minimal in general. 
For infinite codes, it is always infinite. Let us consider two states of A. It is equivalent 
to consider the two prefixes of words of X, say u and v, leading to these states. These 
two states are inseparable if and only if 


u` X =v 'X. 


Note that this equality means on the literal representation of X that the two subtrees 
with roots u and v, respectively, are the same. This yields an easy procedure for the 
computation of the minimal automaton: first, all final states are labeled, say with label 
0. If labels up to 7 are defined we consider subtrees such that all nodes except the roots 
are labeled. Then roots are labeled identically if the (labeled) subtrees are isomor- 
phic. Taking the labels, a: States, we obtain the minimal automaton. The procedure is 
described in Examples 2 ae 


2.1 2.1 
EXAMPLE 2 i continued) The three terminal states are inseparable by Proposition 2 i 


The states a and ba are inseparable since a~!.X = (bqa)~1X = b. No other relation exists. 
Thus the minimal automaton is as given in Fig. 


EXAMPLE 2.2 The literal automaton of X = (b?)*(a7b U ba) is given in Fig. Zel 


Clearly the final states are equivalent, and also the predecessors of final states and 
their predecessors. On the main diagonal, however, tates are only equivalent with 
a step 2. This gives the minimal automaton of Fig. 
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Figure 2.11: The minimal automaton of X = {ab, bab, bb}. 


Figure 2.12: A literal automaton. 


Figure 2.13: Minimal automaton corresponding to Figure PTA. 


EXAMPLE 2.3 In pz the labeling procedure has been carried out for Bis” 


element code of Fig. 2-7, This gives the subsequent minimal automaton of Fig. 


We now consider automata recognizing the submo oid X * generate by a prefix code 
X. Recall that X* 4s pight unitary (Proposition EB Eroposition 1s the analogue 
of Proposition BP.I 


PROPOSITION 2.2 Let P be a subset of A*; the following conditions are equivalent: 
(i) P is a right unitary submonoid. 
(ii) The minimal automaton A(P) has a unique final state, namely the initial state. 
(iii) There exists a deterministic automaton recognizing P having the initial state as 
the unique final state. 


Proof. (i) = (ii). The states in A(P) are the nonempty sets u—!P, for u € A*. Now if 
u € P, then u-!P = P because uv € P if and only if v € P. 
Thus, there is only one final state in A(P), namely P which is also the initial state. 
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Figure 2.14: The computation of a minimal automaton. 


Figure 2.15: A minimal automaton. 


(ii) = (iii) is clear. 

(iii) > (i ). Let A = (Q,i,i) be the automaton recognizing P. The set P then is a 
submonoid since the final state and the initial state arc the same. Further let u, uv € P. 
Then i-u =i and i- uwv = i. This implies that i -v = i because A is deterministic. 
Thus, v € P, showing that P is right unitary. n 


If A = (Q,i,T) is any deterministic automaton over A, the stabilizer of a state q is 
the submonoid 
Stab(q) = {w € A*|q:-w=q}. 
PROPOSITION 2.3 The stabilizer of a state of a deterministic automaton is a right 
unitary submonoid. Every right unitary submonoid is the stabilizer of a state of some 


deterministic automaton. 
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. . . e pe .2.2 
Proof. It is an immediate consequence of the proof of Proposition Eo n 


This proposition shows the importanç gf ight unitary submonoids and of prefix 
codes in automata theory. Proposition Pa nts a method for deriving the minimal 
automaton A(X*) of X* from the minimal automata A(X) of the prefix code X. 


PROPOSITION 2.4 Let X be a nonempty prefix code over A, and let A(X) = (Q, i,t) 
be the minimal automaton of X. Then the minimal automaton of X* is 


(Q,t,t) if Stab(i) Æ 1, (242.19. 1 
(Q—i,t,t) if Stab(i)=1. (232.2). 2 


the action of A(X*), denoted by o , is given by 


qoa=q:a forq#t (2.2.3) 
toa=i-a (2.2.4) 


Proof. Let, B = (Q, ,4) be the automaton obtained from A(X), defining the action o 
by and . Then clearly 


L(B) = {w|tow=t} =X". 


A(X") -f 


Let us verify that the automaton B is reduced. For this, consider two distinct states p 
and q. Since A(X) is reduced, there is a word u in A* separating p and q, i.e., such 


that, say 
p-u=t, quFt. (2.2.5) 


It follows that po u = t, and furthermore po v #¢ for all v < u. If qo u Æt, then u 
separates p and q in the automaton B also. Otherwise, there isa, spallest prefix v of u 
such that qov = t. For this v, we have q.v = t. In view of Cae, Au. Thus v < u. 
But then q o u = t and pov Æ t, showing that p and q are separated by v. 

Each state in B is coaccessible because this is the case in A(X). From 1 4 X, we 
have i £ t. The state i is accessible in 5 if and only if the set {w | tow = i} is 
nonempty, thus if and only if Stab(z) 4 1. If this holds, B is the minimal automaton of 
X*. Otherwise, the accessible part of B is its restriction to Q — i. n 


The automaton A(X*) always has the form given by ESR X is finite. In this 
case, it is obtained by identifying the initial and the final state. The classification of 
the prefix codes according to the form of the minimal automaton A(X*) goes through 
the notion of chain. 

A prefix code X is a chain if there exist disjoint nonempty sets Y, Z such that Y U Z 
is prefix and 

X=Y*Z. 


PROPOSITION 2.5 Let X be a nonempty prefix code over A, and let A(X) = (Q, i,t) 
be the minimal automaton of X. The following conditions are equivalent: 


(i) Stab(i) 4 1, 
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(ii) X is a chain, 
(iii) there exists a word u E€ At such that u-!X = X. 


Proof. (i) = (ii). By Proposition Pat Siab(:) is a right unitary submonoid. Its base, 
say Y, is a prefix code which is nonempty because Stab(z) Æ 1. Let Z be the set of 
words defined as follows: z € Z if and only if i- z = t and i- z’ Æ i for all proper 
nonempty prefixes z’ of z. From t- A = @, it follows that Z is a prefix code. Further 
YAZ #0, byi#t. Finally X = Y*Z. It remains to verify that V = Y UZ is prefix. A 
proper prefix of a word in Z is neither in Z nor in Y, the latter by definition. A proper 
prefix w of a word y in Y cannot be in Z, since otherwise i- w = t whence i- y = 0. 
Thus V is prefix and X is a chain. 

(ii) > (iii). Assume that X = Y*Z with V = Y UZ prefix and YNZ = @. Consider a 
word u € Y. The code V being prefix, we have u™tZ = Ø. Thus u™tX = u7!(Y*Z) = 
u` !Y*Z =Y*Z =X. 

(iii) > (i). The automaton A(X) being minimal, the states of A(X) are in bijective 
correspondence with the nonempty sets v™tX, where v runs over A*. The bijection is 
given by associating the state i -v to v-!X.Thus, the equality u`! X = X expresses 
precisely that i-u = i. Consequently u € Stab(i). n 


-2,1 
EXAMPLE 2 i (continued) The minimal automaton of X* is given in Fig. PTA. The 
finite code X is not a chain. 


Figure 2.16: The minimal automaton of X*. 


.2,2 
EXAMPLE ba (continued) This code is a o The maton A(X*) is obtained 
without suppressing the initial state of A(X). See Fig. 


EXAMPLE 2.4 Consider the code X = ba*b over A = {a,b}. Its minimal automaton is 
given in Fig. 


he stabilizer of the initial stat sf 1.The minimal automaton 
A(X*) given in Fie. is derived from formula peg? 


2.4 
A construction which is analogous to that of Proposition P-atatiows us to define the 
literal automaton of X* for a prefix code X. It is the automaton 


Aa ii 
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Figure 2.18: (a) The minimal automaton of X = ba*b, and (b) of X*. 2_18 


whose states are the proper prefixes of words in X, and with the action given by 


ua ifua EXA, 


u-a=į1 ifuaeX, (2.2.6) 


Ø otherwise. 


This automaton is obtained from the literal automaton for X by identifying all fi- 
nal states of the latter with the initial state 1. It is immediate that this automaton 
recognizes X*. 

The following property of rational prefix codes will be appear to be useful later. 


PROPOSITION 2.6 For any rational prefix code X over A, there exists an integer N 
such that the length of any strictly increasing sequence of suffixes of words of X for the 
prefix order is bounded by N. 


Proof. Let A = (Q,i,T) be a finite automaton with N states recognizing X, and 
assume there is a sequence of N + 1 suffixes so,...,sy of words of X such that each 
sj is a proper prefix of sj+1. Each s; is the label of a path from some state q; into a 
final state t; in A. Moreover there is, for each j, a word p; that is the label of a path 
from 7 to qj. Note that p;q; is in X for each j. By the definition of N, there exist j, k 
with 0 < j < k < N such that qj = qx. Thus both p;s; and pjs, are in X, and pjs; is 
a proper prefix of p;s;,, contradicting the fact that X is prefix. n 


EXAMPLE 2.5 Consider the prefix code X = A*aba \ Ataba over A = {a,b}. The 
sequences of maximal length of strictly increasing sequences of suffixes, for the prefix 
order, are €,a,a”aba with n > 1. Another sequence is £, ba. 


J. Berstel, D. Perrin and C. Reutenauer Version 22 janvier 2008 


112 II. Prefix Codes 


3 Maximal prefix codes 


A subset X of A* is a maximal prefix set if it is prefix and if it is properly contained in 
no other prefix subset of A* , that is, if X c Y C A* and Y prefix imply X = Y. 

As for maximal codes, a reference to the underlying alphabet is necessary for the 
definition to make sense. 

The set {1} is a maximal prefix set. Every other maximal prefix set is a code. A 
maximal code which is prefix is always maximal prefix. The converse does not hold: 
there exist maximal prefix codes which are not maximal as codes. However, under 
mild assumptions, namely for thin codes, we will show that maximal prefix codes are 
maximal codes. 

The study of maximal prefix codes uses a left-to-right oriented version of dense and 
complete codes. 

Let M be a monoid, and let N be a subset of M. An element m € M is right 
completable in N if mw E€ N for some w in M. It is equivalent to say that N meets the 
right ideal mM. A subset N is right dense if every m € M is right completable in N, 
i.e., if N meets all right ideals. The set N is right complete if the submonoid generated 
by N is right dense. The set N is right thin if it is not right dense. Of course. all these 
definitions make sense if right is replaced by left. 

The following implications hold for a subset N of a monoid M: 


N right dense = WN dense 
N right complete = N complete 
N thin = N right thin. 


In the case of a free monoid A*, a subset N of A* is right dense if and only if every 
word in A* is a prefix of some word in N. Thus every (nonempty) left ideal is right 
dense. Similarly, N is right complete if every word w in A* can be written as 


w = Mm- Myp 
for some r > 0, m1,--- , Mr € N, and p a prefix of some word in N. 


PROPOSITION 3.1 For any subset X C A* the following conditions are equivalent: 
(i) XA* is right dense, 
(ii) A* = XA UXUXAT, 
(iii) for all w € A*, there exist u,v € A*, x E€ X with wu = xv. 


Proof. (i) = (iii). Let w € A*. Since X A* is right dense, it meets the right ideal wA*. 
Thus wu = xv for some u,v € A*, and x E€ X. 

(iii) > (ii). If wu = xv, then w € XA~, w € X or w € XA* according to w < zx, 
w=z, CT W >T. 

(ii) > (i). The set of prefixes of X A* is XAT UX UXAT. n 


PROPOSITION 3.2 Let X C A* be a subset that does not contain the empty word. Then 
X A* is right dense if and only if X is right complete. 
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XA~ UX then wu € X for some u € A*. Otherwise w € X At by Proposition 


Proof. Suppose first that X A* is right dense and consider a word w € A*. If $ $ 3i 


Thus, w = rw’ for some x € X, w’ € At. Since x 4 1, we have |w’| < |w|. Arguing by 
induction, w’u € X* for some u in A*. Thus, w is a prefix of some word in X*. 
Conversely, let w € A*, and assume that wu € X* for some u € A*. Multiplying if 
necessary by some word in X, we may assume that wu # 1. Then wu € X+ C XA*. 
| 


23.2 
Note that Proposition Betdoes not hold for X = {1}. In this case, X A* = A* is right 
dense, but X* = {1} is, of course, not. 


THEOREM 3.3 Let X be a prefix subset of A*. The following conditions are equivalent: 
(i) X is a maximal prefix set, 
(ii) X A* is right dense. 
Further, if X # {1}, they are equivalent to 
(iii) X is right complete. 


Observe that the equivalence of prefix maximality and right ore HE giyen in 


this statement is stronger than the result for general codes (Theorem [ff . In the 
general case, the restricti tp thin codes is necessary. 
Proof. By Proposition BS conditions (ii) and (iii) are equivalent. Let us show that (i) 


and (ii) are equivalent. Let u € At. The set X U u is prefix if and only if (X Uu) N 
(X Uu)At = 0, thus, if and only if 


(uN XAT)U(X NUuAt) =9, 


or if and only if u ¢ XA* and u ¢ XA~. Denote by U the set of all words u such that 
X Uu is prefix. Then 


U = A*\ (XAt U XAS). (2.3.1) 

.1 
Clearly X i aximal prefix if and only if X = U; thus, by ES and in view of 
Proposition ae is equivalent to X being right dense. n 


The following corollary appears to be useful. 


COROLLARY 3.4 Let L C At and let X = L\ LAt . Then L is right complete if and 
only if X is a maximal prefix code. 


.3.2 
Proof. L is right al : and only if LAĄ* js right dense (Proposition B from 


X A* = LA* (Proposition [L-2 and Theorem -3 the statement follows. m 


: f 1.4 ; 
We now give the statement corresponding to the part of Theorem Erbi for maximal 
prefix codes. 


THEOREM 3.5 Let X be a prefix code over A, and let P = X A~ be the set of proper 
prefixes of words in X. Then X is maximal prefix if and only if one of the following 
equivalent conditions hold: 


X -1=P(A-!1), (2.3.2) 
1+PA=P+X, (2.3.3) 
A*=X*P. (2.3.4) 
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Proof. Set R = Až yA. If X is maximal prefix, then X A* is right den, e,ayd R=P 
by Proposition BT The conclusion the follows directly from Theorem Pa a versely. 
if X —1=P(A — 1), then by Eq. 


P(A-1)=R(4-1) 
Since A — 1 is invertible we get P = R, showing that X A* is right dense. n 


2 
In the case of a finite m Be code, Eq. (Esai tives a factorization of X —1 


into two polynomials. Formula ( as an interpretation on the literal representation 
of a code X which makes the verification of maximality very easy: if p is a node which 
is not in X, then for each a € A, there must exist a node pa in the literal representation 
of X. ope 

The following statement corresponds to the other part of Theorem and charac- 
terizes those prefix-closed sets which define maximal prefix codes. 


PROPOSITION 3.6 Let R be a nonempty prefix-closed subset of A*, and let X = RA\R. 
Then X is a maximal prefix code if and only if R contains no right ideal. If this the 
case, R= X A`. 


1.4 
Proof. By Theorem EF tis prefix and 
R= A*\ XA*. 


From this, it follows that R contains no right ideal if and only if X A* meets all right 


d als,,i.€. if and only if X A* is right dense. Thus the equivalence follows from Theorem 
Pas. last observation is a consequence of A* = X A* UX A™. n 


We now come to the relation existing between maximal prefix codes and maximal 
codes which are prefix. 


THEOREM 3.7 Let X be a thin subset of At. The following conditions are equivalent: 
(i) X is maximal prefix code, 
(ii) X is prefix and a maximal code, 
(iii) X is right complete and a code. 


Proof. The implication (ii) = (i) is clear. (i) = (iii) follows from oe Batt 


remains to prove (iii) = (ii). Let Y = X — XA*. By Proposition AP = XA", 

Thus Y is right complete. Consequently Y is complete. TePe set Y is also thin, since 

Y C X. Thus Y is a maximal code by Theorem [If rom the inclusion Y C X, we 

have X = Y. n 
3T 

EXAMPLE 3.1 This example shows that Theorem oes not hold without the as- 


sumption that the code is thin. Let $ = ypa!" | u € A*} over A = {a,b} be the 


reversal of the code given in Example [I t is a maximal code vB is right dense, 
whence right complete. However, X is not prefix. From Corollary B-4) it follows that 
Y = X — XA? is a maximal prefix code. Of course, Y # X, and thus, Y is not 
maximal. 
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PROPOSITION 3.8 Let X be a thin subset of At. The following conditions are equiva- 
lent: 

(i) X is a maximal prefix code. 

(ii) X is prefix, and there exists a positive Bernoulli distribution n such that 7(X) = 
1. 

(iii) X is prefix, and 7(X)q = 1 for all positive Bernoulli distributions n. 

(iv) X is right complete, and there exists a positive Bernoulli distribution n such that 
aX = 1, 

(v) X is right complete, and 1(X) = 1 for all positive Bernoulli distributions n. 


3.7 110 
Proof. It is an immediate consequence of Theorem and of Theorem fiers n 


In the previous section, we gave a description of prefix codes by means of the bases 
of the stabilizers in a deterministic automaton. Now we consider maximal prefix codes. 
Let us introduce the following definition. A state q of a deterministic automaton 
A = (Q,i,T) over A is recurrent if for all u € A*, there is a word v € A* such that 
q: uv = q. This implies in particular that q- u Æ Ø for all u in A*. 


PROPOSITION 3.9 Let X be a prefix code over A. The following conditions are equiv- 
alent. 
(i) X is maximal prefix. 
(ii) The minimal automaton of X* is complete. 
(iii) All states of the minimal automaton of X* are recurrent. 
(iv) The initial state of the minimal automaton of X* is recurrent. 
(v) X* is the stabilizer of a recurrent state in some deterministic automaton. 


Proof. (i) = (ii). Let A(X*) = (Q,i,i) be the minimal automaton of X*. Let q € Q, 
a € A. There is some word u € A* such that i-u = q. The code X being right complete, 
uav € X* for some word v. Thus i = i - uav = (q - a) : v, showing that q-a #4 0. Thus 
A(X*) is complete. 

(ii) = (iii). Let q € Q, u € A*; then d = q - u #9 since A(X*) is complete. A(X*) 
being minimal, q’ is coaccessible, and q is accessible. Thus q’: v = q, for some v € A* , 
showing that q is recurrent. 

The implications (iii) > (iv) = (v) are clear. 

(v) = (i). Let A = (Q,i,T) be a deterministic automaton and q € Q be a recurrent 
state such that X* = Stab(q). For all u € A* there is a word v € A* with q- uv = q, 
thus uv € X*. This show, that X is right complete. The set X being prefix, the result 
follows from Theorem 6-3. n 


4 Operations on prefix codes 


Prefix codes are closed under some simple operations. We start with a general result 
which will be used several times. 


PROPOSITION 4.1 Let X, (Yi)icr be nonempty subsets of A*, and let (Xi)icr be a par- 
tition of X. Set 


Z= Lav. 
icl 
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1. If X and the Y;’s are prefix (maximal prefix), then Z is prefix (maximal prefix). 
2. If Z is prefix, then all Y; are prefiz. 
3. If X is prefix and Z is maximal prefix, then X and the Y;’s are maximal prefiz. 


Proof. 1. Assume that z,zu € Z. Then 


z=a2y, zu = zry 


for some i,j € I, x € Xi, y € Yi, £! € Xj, y’ € Y;. From 
ryu = x'y' 


it follows that x = x’ because X is prefix, whence i = j and y = y’. Thus, u = 1 and 
Z is prefix. Assume now that XA* and the Y;A* are right dense. Let w € A*. Then 
ww’ = xv for some w',v € A*, x € X. Let x belong to X;. Since Y;A* is right dense, 
vu’ € Y; A* for some v’ € A*. Thus ww'v’ € X;Y;A*, whence ww'v’ € ZA*. Thus Z is 
maximal prefix. 

2. Let y, yu € Y; and x € X;. Then ry, xyu € Z, implying that u = 1. 

3. From ZA* C XA* we get that X A* is right dense. Consequently X is maximal 
prefix. To show that Y;A* is right dense, let w € A*. For any æ € X;, zw is right- 
completable in ZA*. Thus, xw = zw’ for some z € Z. Setting z = x'y' with x’ € X;, 
y’ € Yj gives cw = 2'y'w'. The code X being prefix, we get x = x’, whence w = y'w’, 
showing that w is in Y;A*. . 


For Card(J) = 1, we obtain, in particular, 


COROLLARY 4.2 If X and Y are prefix codes (maximal prefix), then XY is a prefix 
code (maximal prefix). n 


The converse of this coragll 1% holds only under rather restrictive conditions and will 
be given in Proposition are 


COROLLARY 4.3 Let X C At, andn > 1. Then X is (maximal) prefix if and only if 
X” is (maximal) prefix. 


-4.2 
Proof. By Corollary ZES is maximal prefix for a maxi al, prefix code X. Conversely, 
setting Z = X” = X"-1X, it follows f orp Proposition FHS} that X is prefix. Writing 
Z = XX""1, we see by Proposition EHS that X (and X"~') are maximal prefix if Z 
is. m 


4.3 -4.8 
Corollary is a special case of Proposition Eto be proved later. 


COROLLARY 4.4 Let X and Y be prefix codes, and let X = Xı U Xə be a partition. 
Then Z = Xı U XY is a prefix code and Z is maximal prefix if and only if X and Y 
are maximal prefix. 


Proof. With Y’ = {1}, we have Z = X,Y'U XoY. The result follows from Proposition 
‘because Y’ is maximal prefix. m 


There is a special case of this corollary which deserves attention. It constitutes an 
interesting operation on codes viewed as trees. 
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COROLLARY 4.5 Let X and Y be prefix codes, and x E€ X. Then 
Z=(X\xz)UzY 
is prefix and Z is maximal prefix if and only if X and Y are. n 


The operation performed on X and Y is sketched in Fig. PETS. We now turn to the 
converse operation. 


Figure 2.19: Combining codes X and Y. 2_19 


Z=(X-x)UxY 


PROPOSITION 4.6 Let Z be a prefix code, and let p € ZAT. Then 


Y, =p 'Z and X = Z \ pY, U {p} (2.4.1) 


are prefix sets. Further if Z is maximal prefix, then Yp and X are maximal prefix also. 


. . i ke 1 $: . . . 
Ki operation described in Pi oan be drawn as shown in Fig. Pz. Proposition 


is a special case of the following result. 


X = (Z-pY,) Up 


SS 


Figure 2.20: Separating Z and Yp. 2_20 


PROPOSITION 4.7 Let Z be a prefix code, and let Q be a prefix subset of ZAT. For 
each p E€ ZAT, the set Yp = p 'Z is a prefix code; further 


x =O) (2\ U DY») 
pEQ 
is a prefix set. If Z is maximal prefix, then X and the Yp (p € Q) are maximal prefix. 
Proof. Set Xo = Z \ Upeo PYp, Yo = {1}, Xp = {p}. Then 
Z = Xo% U U XpYp. 
peQ 
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Thus, to derive the result from Proposition fet r suffices to show that X is prefix. 
Let x,xu € X with u € At. These words cannot both be in the prefix set Z nor 

can they both be in the prefix set Q. Since Q C ZAT, we have x € Q, xu € Z. Thus 

u € Y, and xu is not in X. : 


Propositions Eia Eren be used to enumerate some maximal prefix sets. Let us 
illustrate the computation in the case of A = {a,b}. If Z is maximal prefix and Z £ 1, 
then both 

X=alZ, Y=bZ 


are maximal prefix and 
Z=aX UbY. (2.4.2) 


.2 
Conversely, if X and Y are maximal prefix, then so is Z. Thus, Eq. (BEI actines a 
bijection from maximal prefix codes onto pairs of maximal prefix sets. Further 


Card(Z) = Card(X) + Card(Y). 


2 
Let an be the number of maximal prefix sets with n elements. Then by Eq. (ry? 


for n > 2, 
by AKAL. 
k+l=n 


Th sh lated to the Catal b lread tered in Sect ppctiont.4 
ese numbers are Te ate (0) e Uatalan numbers already encountered 1N oectlon 


(see Exercise 
1 2n 
les = — . 
n+l me an 


PROPOSITION 4.8 Let Y,Z be composable codes and X =Y oZ. Then X is a maximal 
prefix and thin code if and only if Y and Z are maximal prefix and thin codes. 


Proof. Ass first that X is thin and maximal prefix. T en, X is right complete by 
Theorem us X is thin and complete. By Proposition R: h Y and Z are thin 
and ope Further Y is prefix (Proposition piai Thus Y (being thin, prefix, 
and complete) is a maximal prefix code. Next X is right dense and X Cc Z*. T ys Z 
is right dense. Consequently Z is a right complete, thin code. By Theorem B. Z is 
maximal prefix. EAT 
Conversely, Y and Z being prefix, X is prefix by Proposition tit 3 and | 
both thin and complete, X is also thin and complete by Proposition [l ~i 
a maximal prefix code. 7 


PROPOSITION 4.9 Let Z be a prefix code over A, and let Z = X UY be a partition. 
Then T = X*Y is a prefix code, and further T is maximal prefix if and only if Z is a 
maximal prefix code. 


Proof. Let B be an alphabet bijectively associated to Z, and let B = C'U D be the 
partition of B induced by the partition Z = X UY. Then 


T=C*DoZ. 
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; : es B 
The code C*D clearly is prefix. Thus, T is prefix by Proposition firey Next, T* = 
1UZ*Y showing that T is right complete if and onl if,4 is right complete. The second 
part of the statement thus results from Theorem Bl n 


ction2.2 
Note that the codes of this proposition are the ps of Section z) We conclude this 


section by the proof of a converse to Corollary 


PROPOSITION 4.10 Let X and Y be finite nonempty subsets of A* such that the product 
XY is unambiguous. If XY is a maximal prefix code, then X and Y are maximal prefix 
codes. 


The following example shows that the conclusion fails for infinite codes. 


EXAMPLE 4.1 Consider X = {1,a} and Y = (a”)*b over A = {a,b}. Here X is not 
prefix, and Y is not maximal prefix. However, XY = a*b is maximal prefix and the 
product is unambiguous. 


Proof Let Z = XY and n = max{|y| | y € Y}. The proof is by induction on n. For 
n = 0, we have Y = {1} and Z = X. Thus, the conclusion clearly holds. Assume n > 1 
and set 

T={yeY|lylJ=n}, Q={qe VA |qANT FD}. 


By construction, T C QA. In fact T = QA. Indeed, let q E Q, a E€ A and let 
x € X be a word of maximal length. Then xq is a prefix of a word in Z, and xqa is 
right-completable in ZA*. The code Z being prefix, no proper prefix of xga is in Z. 
Consequently 

xqav = x'y' 


for some xz’ € X,y' € Y, and v € A*. 
Now n = |ga| > |y'|, and |z| > |x 
qa € Y and T = QA. Now let 


‘|, Thus z = 2’, y’ = qa, v = 1. Consequently 


Y'=(Y\T)UQ, Z'=XY'. 
We verify that Z’ is prefix. Assume the contrary. Then 
ry’ U = xy" 


for some 2,2’ € X, y',y” € Y', u# 1. Let a be the first letter of u. Then either y’ or 
y'a is in Y. Similarly either y” or y”b (for any b in A) is in Y. Assume y’ € Y. Then 
xy’ € Z is a proper prefix of x'y” or x'y”b, one of them being in Z. This contradicts 
the fact that Z is prefix. Thus y'a € Y. As before, xy'a is not a proper prefix of x’y" 
or 2/y"b. Thus necessarily u = a and y” € Y, and we have 


/ yw 
rya=xry 


with y/a,y” € Y. The unambiguity of the product XY shows that x = 2’, y'a = y". 
But then y” ¢ Y’. This yields the contradiction. 

To see that Z’ is maximal prefi serve that Z C Z'U Z'A. Thus ZA* C Z'A* and 
the result follows from Theorem B.3. Finally, it is easily seen that the product XY” is 
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unambiguous: if zy! = s'y” with x,2’ € X, y’,y” € Y’, then either y’,y” € Y \T or 
y' y” € Q, the third case being ruled out by the prefix character of Z. 

Of course, max{|y| | y € Y’} = n — 1. By the induction hypothesis, X and Y’ are 
maximal prefix. Since 


Y = (Y'\Q)UQA, 
-4.4 
the set Y is maximal prefix by Corollary Ft n 


; ; a .10 
It is also possible to give a completely different proof of Proposition EiT usmg the 
fact that, under the hypotheses of the Proposition, we have 7(X)a(Y) = 1 for all 
Bernoulli distributions z. 


5 Semaphore codes 


This section contains a detailed study of semaphore codes which constitute an inter- 
esting subclass of the prefix codes. This investigation also illustrates the techniques 
introduced in the preceding sections. 


PROPOSITION 5.1 For any nonempty subset S of At, the set 
X = A*S \ A*SAT (2.5.1) 
is a maximal prefix code. 


Proof. The set L = A*S is a left ideal, and thus, is right dense. Consequently, L is 
right complete, and by Corollary B.4 the set X = L \ LA+ is maximal prefix. n 


A code X of the form given in Eq. Pais called a semaphore code, the set S being 
a set of semaphores for X. The terminology stems from the following observation: a 
word is in X if and only if it ends with a semaphore, but none of its proper prefixes 
ends with a semaphore. Thus reading a word from left to right, the first appearance of 
a semaphore gives a “signal” indicating that what has been read up to now is in the 
code X. 


EXAMPLE 5.1 Let A = {a,b} and S = {a}. Then X = A*a \ A*a AŤ whence X = b*a. 
Note that we also have X = A*a* \ A*at A? sho ipg that a semaphore code admits 
several sets of semaphores (see also Proposition y 


EXAMPLE 5.2 For A = {a,b} and S = {aa,ab}, we have A*S = A*aA. Thus A*S \ 
A*SAt = b*aA. 


The following proposition characterizes semaphore codes among prefix codes. 
PROPOSITION 5.2 Let X C At. Then X is a semaphore code if and only if X is prefix 


and 
rer. (2.5.2) 
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Proof. Le =,A*S \ A*SAt be a semaphore code. Then X is prefix and it remains 
to show bee Let w € A*X. Since w € A*S, w has a factor in S. Let w’ be the 
shortest prefix of w which is in A*S. Then w’ is in X. C nsequently. we XA*. 

Conversely, assume that a prefix code X satisfies ( . Set M = XA*. In view of 
Proposition E224 by the fact that X is prefix, we have 


X=M\MA’". 


.2 
Equation EJ implies that 
A*M = Æ XA* c XÆ =M, 
thus, M = A*M and 
X = A*M\ A*MAT. , 
EXAMPLE 5.3 The code Y = {a?, aba, ab”, b} is a maximal prefix code over A. However, 
Y is not a semaphore code. since ab € A*Y but ab ¢ Y A*. 


A semaphore code is maximal prefix, thus right complete. The following proposition 
describes those right complete sets which are semaphore codes. 


PROPOSITION 5.3 Let X C A. Then X is a semaphore code if and only if X is right 
complete and 
XNAAÆXA =O. (2.5.3) 


Proof. şemaphore code is maximal prefix, thus also right complete. Further, in view 
of an 


A*XA* C XAY, 


thus 


nis) 
showing Eq. 3° 3 
Conversely, if ; z X satisfies ZE then X is prefix. To show that X is semaphore. 


XNA XAT CXOXAT =O, 


we verify that ( ‘holds. Let w = ua € A*X with u € A*, x € X. The code X 
being rj t complete. we have uxv = 2’yv for some z’ € X, y € X*, v € A*. Now 
Eq. ( shows that ua is not a proper prefix of x'. Thus ux € x’ A*. n 


COROLLARY 5.4 Let X C At be a semaphore code and let P = XA~. Then PX C 
XPUX?. 


.2 
Proof. (See Fig. PET) Let p€ P, x € X. By Eq. ESD pr = yu for some y € X, 
u € A*.. The code X is prefix, thus |p| < |y|. Consequently, u is suffix of x, and 
by , u ¢ XA. The code X is maximal prefix, therefore u € XAT U X. n 


Formula (25-4) expresses a property of semaphore codes which is stronger than the 
prefix condition: for a semaphore code X, and two elements x, x’ € X, the only possible 
way for x to occur as a factor in x’ is to be a suffix of x’. We now use this fact to 
characterize semaphore codes among maximal prefix codes. 
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X 


.5.4 
Figure 2.21: Proof of Corollary Pat 2_21 


PROPOSITION 5.5 Let X C At, and let P = X A` be the set of proper prefixes of words 
in X. Then X is a semaphore code if and only if X is a maximal prefix code and P is 
suffiz-closed. 


Of course, P is always prefix closed. Thus P is suffix closed if and only if it contains 
the factors of its elements. Bi 
Proof. Let X be a semaphore code. Then X is a maximal prefix code (Proposition ie 
Next, let p = uq € P with u,q € A*. Let v € At be a word such that pu € - shen 
q ¢ XA*, since otherwise pu = uqw € X N A* XAT, violating Proposition oe 
qE XA =P. 

Conversely assume that X is maximal prefix and that X N A*XAt #0. Let z € 
XM A*X At. Then x = uz'v for some u € A*, x’ € X, v € AT. It follows that uz’ € P, 
and, P being suffix-clo, ed, also x’ € P which is impossible. Thus X is a semaphore 
code by Proposition B= . 


E 5.3 
Another consequence of Proposition Ba 1S 


PROPOSITION 5.6 Any semaphore code is thin. 
3 
Proof By Formula (AE no word in XA* is a factor of a word in X. n 


COROLLARY 5.7 Any semaphore code is a maximal code. 


5.1 5.6 
Proof A semaphore ç de 4s a maximal prefix code and thin by Propositions Piia B-at 
Thus by Theorem such a code is maximal code. n 


Now we determine the sets of semaphores giving the same semaphore code. 


PROPOSITION 5.8 Two nonempty subsets S and T of At define the same semaphore 
code if and only if A*SA* = A*TA*. For each semaphore code X, there exists a unique 
minimal set of semaphores, namely T = X \ ATX. 


Proof. Let X = A*S\ A*SAt, Y = A*T\ A* AT. By Proposition [.2, We have X A* = 
A*SA*, Y A* = A*TA*, and by Corollary [.3, X = Y if and only if A*SA* = A*T A*. 

Next, let X = A*S\ A*S.AT be a semaphore ode, By the definition of T = X\ ATX, 
we may apply to T the dual of Proposition Po thus, A*T = A*X. Since A*TA* = 
A* X A* = A*S A*, the set T is indeed a set of semaphores for X. 

Finally, let us verify that T C S. Let t € T; then t = usv for some u,v E€ A*, s €S, 
and s = y't'v' for some u’, vu’ € A*, t € T. Thus, t = uu't'v'v. Note that T C X. Thus, 
formula ESS pplies, showing that v'v = 1. Since T is a suffix code, we have uu’ = 1. 
Thus,t=s € S. 7 
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We now study some operations on semaphore codes. 


PROPOSITION 5.9 If X and Y are semaphore codes, then XY is a semaphore code. 
Conversely, if XY is a semaphore code and if X is a prefix code, then X is a semaphore 
code. 


x y 
.5.9 
Figure 2.22: Proof of Proposition Bo 


-4.2 
Proof. If X,Y qm~aphore codes, then by Corollary ray is a prefix code. Further 
by Proposition en 

A XY CXA*Y C XY AŽ, 


thus XY is a semaphore code. 

Assume now that XY is a semaphore code, and that X is a prefix code. We show 
that A*X C XA*. For this, let w = ux € A*X, with u € A*, x € X, and let y bea 
word in Y of minimal length. Then 


wy = uzy = x'y'u' 


for some 2’ € X, y! € Y, u’ € A* (see Fig. Bra). By the choice of y, we have 
ly| < |y’| < |y’u’|, thus |ua| > |x’|, showing that ua € X A*. n 


The following example shows that if XY is a semaphore code, then Y need not be 
semaphore, even if it is maximal prefix. 


EXAMPLE 5.4 Over A = {a,b}, let X = a*b, and Y = {a?,aba,ab?,b}. Then X is 
a semaphore c ode, and Y is a maximal prefix code. However, Y is not semaphore 
(Example n the other hand the code Z = XY is semaphore. Indeed, Z is 
maximal prea and the set 


P=ZA = ař{1,b, ba, bab} 


5.4 
is suffix-closed. The conclusion follows from Proposition Ea (see Fig. p33). 


COROLLARY 5.10 For any X C At and n > 1, the set X is a semaphore code if and 
only if X” is a semaphore code. 


Proof. If X” is a semaphore co © then X is a prefix by Corollary Saal X is T 
semaphore code by Proposition e converse follows directly from Proposition Be. 
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Figure 2.23: The code a*b{a?, aba, ab”, b}. 2_23 


EXAMPLE 5.5 The code X = {a, baa, baba, bab?,b?} represented in Fig. pax is a maxi- 


mal prefix code but not se ore. Indeed. the word a has an inner occurrence in bab?, 
contradicting formula . However, X decomposes into two semaphores codes 
A=Y 6%, 


with Y = {c, dc, d?, de,e} and Z = {a, ba, b?}. 


Figure 2.24: The code X. 2_24 


Given a semaphore code 
X = AS \ A SA® 


it is natural to consider 
Y = SA* \ AT SA* ; 


The code Y is a maximal suffix code; its reverse Ý = A*Š \ A*SAt+ is a semaphore 
code with semaphores S. The following result shows a strong relation between X and 
Y. 


PROPOSITION 5.11 Let S C At. There exists a bijection B from X = A*S \ A* SAT 
onto Y = SA*\ At SA* such that, for each x € X, B(x) is a conjugate of x. 


Proof First,consider J = A*SA*.Then J is a two-sided ideal and further 
Rad JA Y=J\AJ. 


Indeed, J = gA = (A*S')A*. Thus, A*S and J generate the same right ideal. By 
Proposition i F J JA*t = X. The same arguments hold for Y. 
Now we define, for each x € X, 


D(x) = {d € At | there is some g € A* with x = gd and dg € J}. 
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Thus, D(x) is composed of nonempty suffixes of x. Further D(x) is nonempty since 
x is in D(x). Thus, each D(x) contains some shortest element. This will be used to 
define 8 as follows. For x € X, 


Ble) = ag, (2.5.4) 
where d is the shortest word in D(x) and g is such that 


c= ge (2.5.5) 
Thus, G(x) is a conjugate of x, and B(x) € J. We show that 
Ba)EeI\ATI=AY. 
Assume the contrary. Then 


Ae) = dg = uj (2.5.6) 
for some u € At, jeJ. 

Next g is a proper prefix of x. Consequently, g J, since J = JA* = X A*, and thus, 
any word in J has a prefix in X. This shows that |g| < |j|, since otherwise g would 
belong to the ideal generated by j IS g € J 

It follows from this and from pe that |d| > |u|, thus, d = ud’ for some d' € A*. 
Moreover d' € D(x), since d'(gu) = ju € J and (gu)d’ = gd =x € X. This yields a 
contradiction by the fact that d’ is strictly shorter than d. Thus, G(x) € Y. 

Consider the converse mapping y from Y into X defined by considering, for y in Y, 
the set 

G(y) = {e € At | y = eh and he € J}, 


and by setting y(y) = he, with e ( )got mini l length. 
If y = B(x) = dg is given by ( and 63 nd if y(y) = he with e € G(y), 
eh = y, then 


dg = pl) =eh. (2.5.7) 


Note that gd € J. Thus, d € G(y). Consequently , |d| > Jel, gw the word e is not a 
proper prefix of d. Otherwise, setting d = eu, ug = h in ( with u € At, we get 


geu = gd =z, uge=hee J, 


showing that u € D(x) and contradicting the minimality of |d|. Thus d = e, g = h, 
and y(8(x)) = a. An analogous proof shows that 3(y(y)) = y for y in Y. Thus, 8 and 
y are reciprocal bijections from X onto Y. n 


. . rE „11 $ . 
EXAMPLE 5.6 Let us illustrate the construction of Proposition B-tip by" considering, 
over A = {a,b}, the set of semaphores S = {a?, ba, b?}. Then 


X = A*S \ A*SAt = {a° ba, b°, aba, ab7}, 
Y = SA* \ At SA* = {a?, ab, ba, bab, b? }. 


2.1 , 
Table ET tists on each row an element x € X,the corresponding set D(x) and the 
element G(x) € Y. 
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X D Y 
aa a, aa aa 
aba a, ba, aba aab 
abb b, bb, abb bab 
ba ba ba 
bb b, bb bb 


Table 2.1: The X -Y Correspondence. 


.11 ; 
Proposition Blip shows that any semaphore code can be transformed into a suffix 
code by a bijection which conjugates words. This property does not hold for arbitrary 
prefix codes, as shown by 


EXAMPLE 5.7 Let X = {ab, ba,c, ac, bca}. Assume that there exists a bijection 8 which 
transforms X into a suffix code Y by conjugating the words in X. Then Y necessarily 
contains c, and ab, ba. Further Y contains ca (with c and ac, Y would not be suffix!). 
All the words conjugate to bca now have a suffix equal to one of c, ab, ba, ca. Thus, Y 
is not suffix. 


In fact, X cannot be completed into a semaphore code, since c is a factor of bca. 
We end this section with the following result which shows that bifix codes are not 
usually semaphore codes. 


PROPOSITION 5.12 Let X be a bifix semaphore code. Then X = A” for somen> 1. 


Proof. It is sufficient to show that X C A” for some n. Let x,y € X. For each suffix 
q of x, we have qy € A*X C XA*. Thus there is, in view of Proposition 5.3, a prefix p 
of y such that qp € X. 

In this way we define a mapping from the set of suffixes of X into the set of prefixes 
of y. The set X being suffix, the mapping is injective: it follows that |z| < |yl. 
Interchanging x and y, we get |y| < |x|. Thus, all words in X have the same length. 

| 


For semaphore codes, the above result is a particular case of the theorem of synchro- 
nization of semaphores to he stated in the next section. For bifix codes, it is an easy 
consequence of Theorem 


6 Synchronized codes 


Let X be a prefix code over A. A word w € A* is said to be synchronizing for X if for 
any u,v € A*, we have 
uwv E€ X* > uw € X*. 


Observe that if this holds, then v also is in X* since X* is right unitary. Note that the 
set of synchronizing words is a left ideal. 


EXAMPLE 6.1 The word w = abba is synchronizing for the prefix code X = {ab, ba}. 
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A prefix code X is synchronized if and only i there, exists a word in X* which is 
synchronizing for X. We will see later (Chapter k Í i) a definition of synchronized codes 
for general codes. 

If X is a maximal prefix code, then w is synchronizing for X if and only if 


A*w c X*. (2.6.1) 


Indeed, let w be a synchronizing word. For any u in A*, since X* is right denge, there 
exists a word y a that uwv € X*. Then uw € X*. This shows that BAF holds 
Conversely, if olds, then uw € X* for all u € t ni thus w is synchronizing. 

Observe that if X is a maximal prefix code, then by ( every synchronizing word 
is in X*. 


EXAMPLE 6.2 The code X = b*a is synchronized. Indeed, a is a synchronizing word, 
since A*a C X*. 


EXAMPLE 6.3 A maximal bifix code X over A is never synchronized unless X = A. 
Assume indeed that w € A* is synchronizing. For any u € A* we have uw € X*. The 
monoid X* being left unitary, it follows that u € X*. Thus A* = X*. 


The terminology is derived from the following observation: let w be a word which 
has to be factored into words of some prefix code X. The appearance, in the middle of 
the word w, of some synchronizing word zx i.e., the existence of a factorization 


W = UTV 


implies that ux is in X*. Thus we may start the decoding at the beginning of the word 
v. Since X* is right unitary we have indeed w € X* if and only if v € X*. That means 
that the whole word is in X* if and only if the nal part can be decoded. 

Note that any code X over A satisfying tai : 1s maximal prefix. Indeed, let y, yu € 
X. Then ux € X*, and y(ux), (yu 5 a {wo X-factorizations which are distinct if and 
only if u #1. Thus u = 1. Next, ( shows that X is right complete. 

Any synchronized code is thin. Indeed, if x is a synchronizing word for a code X, 
then z? is not a factor of a word in X, since otherwise uxxv € X for some u,v € A*. 
From uz € X*+, it would follow that X is not prefix. 

The fact that a code X is synchronized is well reflected by the automata recognizing 
X. Let us give a definition. Let 

A = (Q,1,T) 


be a deterministic automaton on A. The rank of a word x € A* in A is defined by 
rank (xz) = Card(Q- x). 

It is an integer or +00. Clearly 
rank ,(urv) < rank 4(zx). 


A word w € A* is a synchronizing in A if rank,(w) = 1. The automaton A is 
synchronized if there exists a word which is synchronizing in A. 
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PROPOSITION 6.1 Let X be a maximal prefix code over A. The following conditions 
are equivalent: 


(i) X is synchronized. 
(ii) The minimal automaton A(X*) is synchronized. 
(iii) There exists a synchronized deterministic automaton A recognizing X*. 


Proof. (i) = (ii). Let x € X* be a synchronizing word for X. Let A(X*) = (Q,i,i). 
Let q € Q, and let u € A* with i- u = q. From ux € X%, it follows that q- 2 = i. Thus 
Q -x = {i}, showing that rank 4(x+)(x) = 1. 

The implication (ii) = (iii) is clear. 

(iii) > (i). Let A = (Q, i, T), let w E€ A* be such that rank 4(w) = 1 and let go = Q-w. 
Since X is right complete, there exists a word v such that gg: uv € T. Set x = wv. We 
verify that x is synchronizing. Let u € A*, and set q = i- u. From q- a € T, we have 
ux E X*. n 


PROPOSITION 6.2 Let X be a thin maximal prefix code over A, and let P = XA”. 
Then X is synchronized if and only if for all p € P, there exists x E€ X* such that 
pr € X*. 


Note that the existence of a unique word synchronizing all prefixes is not required in 
the hypotheses of the proposition. 
Proof. The condition is clearly necessary. It is also sufficient. Let w € A* be such that 
w is not a factor of a word in X. Let K = {pj,po,...,Pn} be the set of suffixes of w 
which are in P. 

The set K is not empty since 1 € K. Define a sequence 21,...,2, of words in X* 
as follows: x; is a word in X* such that pja, E€ X*. If x1,...,a2;_-1, are defined, there 
exist, because X is right complete, words y € X*, p€ P with 


Piti: Zi—1 = YP. 
Then z; is defined to be a word in X* such that px; € X*. It follows that for i =1,...,n, 
ply... L412; E X*. (2.6.2) 
The word z = wx 12%9°-- £n is synchronizing for X. Indeed let u € A*. Then 
uw = yp! 


for some y’ € X* and p' € P. The word p’ is a suffix of w, since o ryise w would be 
a factor of a word in X. Thus, p’ = p; for some p; € K. Using , we obtain 


uz = y (piper +++ £i) (Ti+: En) E X*. 7 
PROPOSITION 6.3 Let X,Y,Z be maximal prefix codes with X = Y o Z. Then X is 
synchronized if and only if Y and Z are synchronized. 
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Proof. Let Y C B*, X,Z C A*, and 8 : B* — A* be such that 
X=Y oZ. 


First, assume that Y and Z are synchronized, and let y € Y*, z € Z* be synchronizing 
words. Then B*y C Y* and A*z C Z*, whence 


A*zB(y) C Z*Bly) = B(B*y) c BY") = X*, 


showing that z3(y) is a synchronizing word for X. Conversely, assume that A*x C X* 
for some x € X*. Then x € Z* and X* C Z*; thus, x is also synchronizing for Z. Next, 
let y = 8-1(x) € Y*. Then 


B(B*y) = Z*z C Ate C X* = BY"). 
The mapping p being injective, it follows that B*y C Y*. Consequently Y is synchro- 


nized. m 


EXAMPLE 6.4 The code X = (A? \ b?) Ub? A? is not syn hrgnized, since it decomposes 
over the code A? which is not synchronized (Example Fh" is also directly clear that 
a word x € X* can never synchronize words of odd length. 


EXAMPLE 6.5 For any maximal prefix code Z and n > 2, the code X = Z” is not 
synchronized. Indeed, such a code has the form x = B” o Z for some alphabet B, and 
B” is synchronized only for n = 1 (Example Es 


We now give a result on a codes hich will be generalized when other techniques 


will be vailable (Theorem . The present proof is elementary. Recall from 
Chapter i that for a finite code X, the order of a letter a is the integer n such that a” 
isin X. 


; ha p ; 
The existence of the order of a results from Proposition litt Note that for a finite 
maximal prefix code, it is an immediate consequence of the inclusion at C X*P, with 
P= XA. 


THEOREM 6.4 Let X C At be a finite maximal prefix code. If the orders of the letters 
a E€ A are relatively prime, then X is synchronized. 


Proof. Let P = XA™ and let A = (P,1,1) be the literal automaton of X*. This 
automaton is complete since X is maximal prefix. Recall that its action is given by 


For all w € A*, set Q(w) = P - w. Then for w, w’ € A*, 
Q(w'w) c Q(w), CardQ(w'w) < Card Q(w’). (2.6.3) 
Observe that for all w € A*, Card(Q(w)) = ranka (w). 
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Let u € A* be a word such that Card(Q(u)) is minimal. The code X being right 
complete, there exists v € A* such that w = uv € Xt. By ( 7 Card(Q(w)) is 
minimal. Further w € X* implies 


1 € Q(w). (2.6.4) 


6.1 
We will show that Card(Q(w)) = 1. This proves the theorem in view of Proposition 3 i 


Let a € A be a fixed letter, and let n be the positive integer such that a” € X. We 
define two sets of integers J and K by 


I={ieN| Q(w)a'nx £9}, 
K = {k € {0,...,n—1} | a"w € X*}. 
First, we show that 
Card I = Card Q(w). (2.6.5) 


Indeed, consider a word p € Q(w)(C P). There is an integer i such that pa’ € X, since 
X is finite and maximal. This integer is unique since otherwise X would not be prefix. 
Thus there is a mapping which associates to each p in Q(w) the integer i such that 
pat € X. This is clearly a surjective mapping onto I. We verify that it is also injective. 
Assume the contrary. Then pa’ € X and p'at € X for p,p' € Q(w), p # p'. This implies 
Card(Q(wa')) < Card(Q(w)), open the minimality of Card(Q(w)). Thus the 


mapping is bijective. This proves ( - Next set 
m=max{it+k|iel,ke K}. 

Clearly m < max I + max K < maxl+n-—1. Let 
R={m,m+1,..., m+n- 1}. 


We shall find a bijection from IJ x K onto R. For this, let r € R and for each p € Q(w), 
let 


v(p) =p-a'w. 
Then 
v(p) = (p-a")-we P: w= Q(w). 
Thus v(Q(w)) c Q(w) and v(Q(w)) = Q(w) by é at th of Q(w). Thus v 


is a bijection from Q(w) onto itself. It follows by ( that there exists a unique 
pr E€ Q(w) such that pra”w E€ X*. Let i, be the unique integer such that ppa” € X. 
Then 7, € I whence ip < m < r. Set 


r= ip +Ant kr, (2.6.6) 
with A € N and 0 < kp < n. This uniquely defines k, and we have 
pa w = (pra'™)(a")*(a*w). 


Since p,a” € X and X* is right unitary, we have (a”)*(a*"w) € X* and also a*-w € X*. 
Thus, k, € K. The preceding construction defines a mapping 


RoIxK, re (ir, kp) (2.6.7) 
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injective. Indeed, if r Æ r’, then either i, Æ ip, or it follows from nd from 
r#r' mod n that k, £ kr. 7 

We now show that the mapping A surjective. Let (i,k) € Ix K, and let A € N 
be such that 


first by determining ip, then by computing k, by means of (7 pase. apping is 
a 


r=itrAnt+kerR. 


By definition of I, there is a (unique) q € Q(w) such that ga’ € X, and by the definition 
of K, we have 
qa’ wex. 


Thus, q = pr, i = ip, k = kr, showing the surjectivity. 


It follows from the bijection that 
n = Card(R) = Card(I) - Card(K). 


.5 
This in turn implies, by baa that Card Q(w) divides the integer n. Thus Card Q(w) 
divides the order of each letter in the alphabet. Since these orders are relatively prime, 
necessarily Card(Q(w)) = 1. The proof is complete. n 


ction4.7 
We will prove later (Section ži the following important theorem. 


THEOREM 6.5 Let X be a semaphore code. Then there exists a synchronized semaphore 
code Z and an integer d such that 


X=Z’. 
Badass 12 
This result admits Proposition ETJ as 7 as a special case. gngider indeed a bifix sema- 
phore code X C At. Then according to Theorem we have with Z 
synchronized. The code X being bifix, Z is 3, also bifix Se ae eee a bifix 
synchronized ode i is trivial by Example us, Z = A and X = Af. 
Theorem escribes in a simple manner the structure of semaphore codes which are 


not synchronized. Indeed, if X = Z° wit, £i synchronous, then X is not synchronized 
if and only if d > 2 (see also N RES 

We may ask whether such a description exists for general maximal prefix codes: is 
it true that an indecomposable maximal prefix code X is either bifix or synchronized? 
Unfortunately, it is not the case, even when X is finite, as shown by 


Q 1 2 3 4 5 6 7 8 9 


Q 
N 
ww 
ote 
w 
Co 
w 
= 


Table 2.2: The transitions of A(X*). 


EXAMPLE 6.6 Let A = {a,b}, and let X be efix code with automaton A(X) = 
(Q,1,1) whose transitions are given in Table ‘The automat (X*) is complete, 
thus X is maximal prefix. In fact, X is finite; it is given in Fig. . 
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Figure 2.26: 


To show that X is not synchronized, observe that the action of the letters a and b 
preserves globally the sets of states 


{1,2,3}, {1,4,5}, {4,6,7}, {1,8,9} 


as shown in Fig. Zi This implies that X is not synchronized. Assume indeed that 
x € X* is a synchronizing word. Then by definition A*x C X*, whence q- x = 1 for all 
states q € Q. Thus for each three element subset J, we would have I- x = {1} 

Further X is not bifix since b°,ab4 € X. Finally, the inspection of re BS shows 
that X is indecomposable. 


We define a canonical decomposition of a prefix code called its maximal decomposi- 
tion. This is used to show in Chapter Rape only maximal prefix codes may produce 
nontrivial groups by composition. 


PROPOSITION 6.6 Let X C At be a prefix code. Let 


D = X*(A*)"! = {w € A* | wA* N X* ZO} 
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be the set of prefixes of X*. 
The set 
U = {u € A* | u™!D = D}. 


is a right unitary submonoid of A*. Let Z be the prefix code generating U. The code 
X decomposes as 
X=YoZ (2.6.8) 


where Y is a maximal prefix code. 


Proof Note first that U C D: Let u € U. Since 1 € D, we have 1 € u~!D, whence 
ué D. 

The set U is clearly a submonoid. Let u,v € U. Then (uv)-!D = v7!u7!D = 
v !D =D showing that uv € U. Assume next that u,uv € U. Then u~!D = D, and 
vtD=v1!u!D = (w)! D = D. Thus U is right unitary. 

We have X* cC Z* =U. Indeed, X* is right unitary. Thus for all 2 € X*, 271X* = 
X*. It follows that 


at D = oA) 


We verify that for u € U, there exists v € U such that uv € X*. 

Indeed, let u € U. Then u € D, and therefore uv € X* for some v € A*. Since 
X* C U, we have u,uv € U, and consequently v € U (U is right unitary). The claim 
shows that X deco gges over Z. Let Y be such that X = Y o Z. Then Y is prefix 
by Proposition pe [he claim also shows that Y is right-complete, hence Y is prefix 
maximal. n 


It can be shown (Exercise Boy that for any other decomposition X = Y’ o Z’ with Z’ 
prefix and Y’ maximal prefix, we have Z™* C Z*. This justifies the qme of maximal 
decomposition of the prefix code X given to the decomposition rats 

In the case where X is ximal prefix code, the set D defined above is A*. Thus 
U = A* and Z = A in ( Thus the maximal decomposition, in this case, is trivial. 


7 Average length 


hapter1 
The results of Chapter ff concerning the measure of a code apply of course to prefix 
codes. However, for these codes, considerable extensions exist in two directions. First, 
the properties proved in Chapter [f hold for measures which are much more general than 
those defined by Bernoulli distributions. Second, there exists a remarkable combinato- 
rial interpretation of the avera jength of a prefix code by means of the measure of its 
proper prefixes (Proposition Pao 
A function 
m : A* 30, 1] 
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is a (positive) cylindric measure if the two following conditions are satisfied: 


mL) = (2.7.1) 
Ti) = 5 n(wa). (2.7.2) 


acA 
The function 7 is extended to subsets of A* by setting, for L C A*, 


m(L) =$ alu): 


uel 


12 
pe P nay be written as 7(w) = m(wA). Note that this implies, in view of 


, that 


and for all w € A*, 


m(A) =1. 


More generally, for n > 0 
1(A”) =1, 


since 


n(A") = X` a(wA) = X x(w) = 2A"). 


we An weAnr 


A positivi ern ulli distribution is of course a cylindric measure. The converse is false. 
Example {7.1 shows that cylindric measures are quite general. 


EXAMPLE 7.1 Let A = {a,b}, and let (an)n>0 be a sequence of positive real numbers 


such that the series 3, Qn converges and 
n>0 


X on=a<1. (2.7.3) 


n>0 


Define a sequence (Bn)n>0 by 89 = 1 and for n > 1, 


n—-1 
Bn=1- Y a4. 
i=0 
Then by construction 
0< G1. (n>1), Qan + Bn41 = Ên (n20). 
We define a cylindric measure on A* by setting for n > 0, u € AF, 


n(ba)=an, T) = Bn Tbau) = an2, 


It is easily verified that a is indeed a cylindric measure. Let X =,b*a. The real 
numbers an, Ên are reported on the literal representation of X (Fig. . Note that 
wx) = DS Oy = Oh. 
n>0 
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Figure 2.27: Some values of the cylindric measure. 2_27 


EXAMPLE 7.2 Let a : A* —]0,1] be a cylindric measure, and let u € A*. Define a 
function p : A* —]0, 1] by 


p(w) = , weAl*, 


Then p is a cylindric measure since 7(1) = 1 and 


Yele) = i a(utsa) = E — p(w), 


acA 


PROPOSITION 7.1 Let a be a cylindric measure on A%*. 
1. For any prefix code X, we have 


m(X) <1. 
2. For any finite maximal prefix code X, we have 
a(x j=1. 


2 110 
For Bernoulli distributions, this follows from Theorems [ipa ana erf and from the 
fact that a finite set is thin. 


Proof. We first show 2. If Card(A) = 1, or X = A, then the result is clear. Otherwise, 
let x € X be a word of maximal length, and let t € A*, a € A be such that x = ta. 
Then by the maximality of x, tA C X. Ift = 1, then X, ze Thus, t 4 1. The set 
Y = (X \tA) Ut is a maximal prefix code (Proposition . Next, Y has strictly fewer 
elements than X; this comes from Card(A) > 2. Further 


n(Y) = n(X) — r(tA) + r(t) = 7(X). 


This shows that 7(X) = 1 by induction of the cardinality. 
To prove statement 1, let X be an arbitrary prefix code and let 


Xn = {xE X | |z| <n}. 
According to what we just proved, we have 7(Xn) < 1. Thus, 


m(X) =sup7(X,) <1. n 


n>0 
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a < 1. This shows that the second part of Proposition ‘doés not hold for infinite 
BS addition, since any number a €]0,1[ can be written in the form given by 
, the cylindric measure of the infinite maximal prefix code X can take any value 
in the interval ]0, 1]. 
Given a prefix code X C A* and a cylindric measure m on A* such that 7(X) = 1, 
the average length of X (relatively to 7) is defined by 


MX) = S° hhe): 


rex 


stall 
EXAMPLE fF f (continued) The code X = b*a is maximal Bep Nevertheless, 7(X) = 


It is a nonnegative real number or infinite. The context always indicates which is the 
underlying cylindric measure. We therefore omit the reference to it in the notation. 

The quantity A(X) is in fact the mean of the random variable assigning to each z € X 
its length |z]. 


PROPOSITION 7.2 Let X be a prefix code and let P = XA`. Further, letm be a 
cylindric measure such that 7(X) = 1. Then A(X) =7(P). 


Proof. We start the proof by showing that each p € P satisfies 

n(pA*ž 1X) =1(p). (2.7.4) 
Consider indeed the function p : A* —]0, 1] defined by 

p(w) = m(pw)/m(p) . 
In view of Example EET. a cylindric measure. On the other hand 

pA* N X =p(p™ X), 


-4.6 ee 
and by Proposition Eate set p™tX is a prefix code. Proposition if ttirefore implies 
that o(p 71X) < 1. From 


p(p™ +X) = n(pA* N X)/z(p) 


it follows that 
n(p) > a(pA* N X). (2.7.5) 


Next let 
Y = (X \ pA* N X)) Up. 


7.1 


-4.6 
By the same Proposition rot thre set Y is a prefix code; according to Proposition 2 i 


mY) <1. Thus, 
m(X) —m(pA*1X)+7(p) <1. 


By assumption, 7(X) = 1. Consequently 
n(p) < m(pA* N X). 

-5 4 

Together with (Br this yields t= 
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Now, we observe the following identity concerning formal power series 


X |z = So (p40 X). (2.7.6) 


rex pEP 
.6 
Indeed, the multiplicity of a word x € X on the right-hand side of ( S umber 
of its prefixes in P. This number is precisely its length. It follows from ( that 
X` |zja(z) = X` (pat X). 
rEX pEP 


4 
The left-hand side is A(X), and by Eii we obtain 


pEP 


For a finite maximal prefix code, the cylindric measure equals 1. Thus, 


COROLLARY 7.3 Let X be a finite maximal prefix code and P = X A`. For any cylin- 
dric measure t on A*, the following holds: 


A(X) = 7(P). = 


For a Bernoulli distribution, the finiteness condition can be replaced by the condition 
to be thin. 


COROLLARY 7.4 Let X be a thin maximal prefix code, and P = X A`. For any positive 
Bernoulli distribution n on A*, we have 


A(X) =P) 


Further, the average length A(X) is finite. 


sabi: 4 
O ZÀ 


Ic 110 
Proof. he code X being maximal, Theorem [6.12 shows that ™(X) = 1; thus, the 


equality A(X) = 2(P) follows from Proposition [7-3 Moreover, P is thi since, each 
factor of a word in P is also a factor of a word in X. By Proposition [I TUP) is 
finite. n 


apter6 i à sua Sk 
We shall see in Chapter fx{ that the average length is still finite in the more general 


case of thin maximal codes. 


EXAMPLE 7.3 Let A = {a,b} and X = a*b. For any positive Bernoulli distribution m 
, let z(a) = p, 7(b) = q. Then 


A(X) =a(a")=1/e. 
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5 
EXAMPLE 7.4 Let D be the Dyck code over A = {a,b} (see Example iby We have 


seen that for a Bernoulli distribution given by m(a) = p, 7(b) = q, the following formula 
holds : 

™(D)=1—|p—4|. 
Further, setting dn = 7(DNM A”), we have by formula firs 


2 f2n—-—2 
dən = 2( ora don41 = 0 (#21). 
n\n-1 


It follows that, for p = q = 1/2, we have 7(D) = 1 and 
2n — 2 
A(D) = Y 2nd, = 3a 4 P) 
n>0 n>0 1 


This series diverges since, according to Stirling’s formula, 


22n (7 — ’) 2 
2 ~ ——. 
n-1 \/m(n — 1) 
Thus, the average length of D is infinite for p = q = 1/2. 


The next example concerns the computation of the average length of semaphore 
codes. We start with an interesting identity. 


PROPOSITION 7.5 Let X C At be a semaphore code, P = XA™~ and let S be the 
minimal set for which X = A*S \ A*SA*. For s,t € S, let 


X,= XN A*s Rest = {w € A* | sw € A*t and |u| < |t|}. 


Then, for allt € S, 


Dia Aia (2.7.7) 


ses 


Proof. First, we observe that each product X. Rs, t, is unambiguous, since X, is prefix. 
Further any two terms of the sum are disjoint, since X = |J X, is prefix. Thus, it 
suffices to show that 


Pr= |] Aah. 
ses 


Figure 2.28: Factorizations of pt. 2_28 


First let p € P, and let x be the shortest prefix of pt, which is in A*S. Then x € X 
and 
pt = rw 
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for some w € A*. Next x € X, for some . Set x = us. The word p being in P we 
have |p| < |x|, whence |w| < |t| (see Fig bes Now p cannot be a proper prefix of u, 
since otherwise s would be a proper factor of t, contradicting Proposition oe the 
minimality of S. Thus, u is a prefix of p and sw € A*t , showing that w € Rs.. 
Conversely, let x E€ Xs and w € RR,» for some s,t € S. Then x = us and sw = It 
for a proper prefix J of s. Then ul is a proper prefix of us = x; thus, ul € P and 
xw = ult € Pt. n 


7.5 
COROLLARY 7.6 With the notations of Proposition at we have for any Bernoulli 
distribution T, the following system of equations: 


MX)m(t) = So a(Xs)r(Rsz), (LE $), (2.7.8) 
sES 

Xo a(X:)=1. au 

sES 


For the case of a finite set S, the system (ras a set of 1+Card(S) linear equations 
in the 1 + Card(S) unknown variablest(X,) and A(X). The sets Rs; being finite, the 
coefficients 7(R, 4) are easily computed. This yields the desired expression of A(X). In 
the special case where S' is a singleton, we get 


COROLLARY 7.7 Let s € At, let X = A*s \ A*sAt and R = {w € A* | swe 
A*s and |w| < |s|}. Then for any positive Bernoulli distribution 7, we have 


A(X) = 1(R)/n(s). m 


EXAMPLE 7.5 Let A = {a,b} and consider s = aba. The corresponding set R is 
R = {1,ba}. If t(a) = p and 7(b) = q = 1 — p, then for X = A*aba \ A*abaA*, we get 


1+ pq 
MX) = 


Now, choose s’ = baa. The corresponding R’ is the set R’ = {1}. Thus, for X’ = 
A*baa \ A*baaA*, we have 


For p = q = 1/2, this gives A(X) = 10, A(X’) = 8. This is an interesting paradox: we 
have to wait longer for the first appearance of aba than for the first appearance of baa ! 


8 Length distributions 


Let X be a prefix code on the alphabet A with k letters. Let u(z) = J n>o Unz?” with 


Un = Card(X 7A"). We refer to the series u as the length distribution of X. ETE 
By Theorem p.9, one has u(1/k) = J n>0 Unk” < 1. Conversely, by Theorem Bor 1f 


u(1/k) = 1, there exists a prefix code X such that un =Card{ X N 4”). 

If X is a thin maximal prefix code, then by Theorem [6-12, u(1/k) = 1. Conversely, 
if u(1/k) = 1, then there exists prefix code X such that un = Card(X N A”) which is 
clearly a maximal code, hence a maximal prefix code. 
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ef OX 
If X is regular, then the series u(z) is N-rational by Proposition fit The following 


statements proves the converse. 


THEOREM 8.1 A series u(z) = 0,39 Unz” is the length distribution of a recognizable 
prefix code on k letters if and only if it is N-rational and satisfies the inequality u(1/k) < 
1. 


The conditions are obviously necessary. To prove that they are sufficient, we prove 
several intermediary results. We assume from now on that u is an N-rational series and 
that u(1/k) < 1. We may suppose that uo = 0. The following lemma is the first step 
of the proof. 


LEMMA 8.2 If A = (Q,i,t) is a normalized weighted automaton recognizing u, the 
adjacency matrix of A has a k-approximate eigenvector w which is positive and such 
that Wi = W. 


Proof. Let A = (Q,i,t) be a normalized weighted automaton recognizing u. Let A 
be the weighted automaton on the set of states Q = Q — {t} obtained by merging i 


and t. Let M be the adjacency matrix of A. Since A is trim, M is irreducible. By 
Proposition He CAHD, recognizes u*(z) =1/(1- u(z)). Since uf lik) < A, the radius 
of convergence p of u* satisfies p > 1/k. By Proposition bes the spectral radius A 
of M is 1/p. Thus à > k and by Proposition Poy ines is a positive k-approximate 
eigenvector w of M. It is then easy to verify that the Q-vector w defined by wg = Wy 
for every q Æ t and w; = Ñ; is also a positive k-approximate eigenvector of M. Indeed, 
since Mp, = Mp, for all p, the inequality X` MyqWq < kūp gives > My qwq < kwp as 
required. n 


Observe that in the case where u(1/k) = 1, the k-approximate eigenvector w obtained 
in the proof is in fact a positive k-eigenvector instead of an approximate one. 

We will use the following two combinatorial Jgmmas of some independent interest. 
These will be used in he yoot of Lemma SR vc that if at the present step of 
the proof of Theorem we already have w; = w; = 1, we can directly skip to the end 
of the proof. 

The first combinatorial lemma is a variant of the pigeon-hole principle. 


LEMMA 8.3 For any sequence £1,%2,...,2m of m positive integers, there is a non- 
empty set S C {1,2,...,m} such that >) eg £q is divisible by m. 


Proof. The partial sums 71,271 + £2, £1 + £2 + £3,..., £1 + G2 +--+ + £m either are 
all distinct (mod m), or two are congruent (mod m). In the former case, at least one 
partial sum must be congruent to 0 (mod m). In the latter, there arel <p<r<m 
such that 

£1 + T2 +-+: + Tp = T1 + £2 +- + £r mod m). 


Hence £p41 + 2%py2 +++: + Er = 0 (mod m). . 


The second lemma deals with Q-vectors. For a Q-vector x = (%q)geq, we denote 
d(x) = J qeq T4 and for two Q-vectors x = (Xq)qeq and y = (Xq)qgeq, we denote 
rey pe) TqYq- 
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LEMMA 8.4 Let w € N® and set m = w; for some fixed t€ Q. Let t be the character- 
istic Q-vector of t. For any y € NÌ, there exist n+ 1 vectors vo, V1,--.,Un E N@ such 
that y = Xj- vj, with 

(i) d(uj) < m and vj -t € {0,1} forO<j <n, and 
(ii) uj-w=0 mod m forl<j <n. 


Proof. We first write y’ = y — yt. If the result holds for y’, then it holds for y since t 
satisfies conditions (i) and (ii). Thus we may assume that y; = 0. 

We proceed by induction on d(y). If d(y) < m, then the properties hold with n = 0 
and vo = y. Otherwise, we write y = v + y' with d(v) =m. Let £1, £2,...,£m be the veantaaa 
sequence formed by the numbers wy with each wq repeated vg times. By Lemma <n 
there is a set S C {1,2,...,m} such that ` eg zq is divisible by m. Let v’ be the 
Q-vector where vi, is the number of times wg is selected in the set S. Then v’ 4 0 
and v -w = 0 modm. We write v = v’ +s with s Æ 0. Thus y = v’ + y” with 
y" =s +y'. Since d(y”) < d(y), we can apply the induction hypothesis, obtaining the 
desired result. n 


LEMMA 8.5 There exists a normalized weighted automaton A = (Q,i,t) recognizing 


u such that the adjacency matrix of A has a positive k-approximate eigenvector w 
satisfying wi = w = 1. 


Proof. We start with a trim normalized weighted tomaton i = (Q,i,t) recognizing u. 
Let M be the adjacency matrix of A. By Lemma FE a positive k-approximate 
eigenvector w of M such that w; = w+. We are going to define a new weighted automa- 
ton A’ = (R, 7’, t’) recognizing u with adjacency matrix N with the property that there 
is a nonnegative R x Q-matrix T such that 


TM = NT. 


We now construct the set R as a set of vectors in N® as follows. Initially, R is composed 
only of the characteristic vector i of i. dens 

For each nonnegative vector r in R, we apply Lemma Balto y= rM with w the 
k-approximate eigenvector of M and m = w; = w. This gives a decomposition rM = 
0 vj. We add vp,...,Un to the set R. For s € R, let N, s be the number of indices 
j (0 < j < n) such that vj = s. By definition 


rM = ip Nyp,38 « 
sER 


Repeat the construction until no new element is added to the set R. The construction 
stops because the sum of the entries of an element of R is bounded by m. 

This defines the matrix N. The matrix T is defined by the condition that its row of 
index r is the vector r itself. Observe that by construction TM = NT, since the row 
of index r in TM igo M, and the row of index r in NT is ee Nr s5. 

By Lemma 8-4) we have r; = 0 for all r € R, except r = t, where we recall that t is 
the characteristic vector of t. We define A’ = (R, i’, t’) by setting i’ = i, t = t. Let i’ 
(resp. t’) be the characteristic R-vector of the state i’ (resp. of the state t’). We obtain 
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iT =i and Th t’. Indeed, for each r € R, the component Tt, is equal to r-t. By 
Lemma emagan. 0 except for r = t. 
Since TM” = N"T for all n > 1, we have 


un =iM”t = i TM”t =i N"Tt = ï N"t. 


This shows that u is recognized by A’. We also have NTw = TMw < kTw and thus 
w’ = Tw is a k-approximate eigenvector of N. For each r € R, we have 


/ / 
`> Nr sws < kw, . 
sER 


eComb 
Since w, = (Tw), = s - w, by Lemma ET we have N, sw, =0 mod m for all s except 
possibly for one fixed index sọ. We rewrite the inequality as 


> N, sw, + Nr soWs, < kup: 
s#so 


Dividing by m both sides gives 


5 N, sw, /m + Nr sows /M < kw, /m. 
s#so 


Taking the ceiling of both sides gives 


[D2 Nests /m + Nnsowso/m]| < [hw /m] 
s#so 


Since on the left hand side, all terms are integers except possibly the last one, one has 


po Ny sw, /m + Nnsowla/m| = > N, sw, /m + [Nr sow /m] . 


sŻso s#Żso 
This shows that the vector w” defined by w” = [wi./m] is a positive k-approximate 
eigenvector such that wi, = wi = 1. : 
-SIAM , , 
Proof of Theorem e first show that there exists a normalized weighted automaton 


A = (Q,i,t) recognizing u such that each state has at most k outgoing edges. 

We start with a normalized weighted automaton A = (Q, i,t) recognizing u with state 
set Q such that the adjacency matrix M of G has a positive k-approximate eigenvector 
w with w; = w = 1. We are going to define a weighted automaton A’ = (R, 7’, t’) with 
adjacency matrix N with the property that there is a nonnegative matrix T such that 


MT =TN. 


(note that the transposition is the opposite of that used in Lemma Bea). BY Construc- 
tion, the sum of each row of the matrix N will be at most k. 

The set R contains wq copies of each state q in Q. Note that, since w; = 1, the set 
R contains only one copy of the initial state i. Formally, R is the set of pairs (q, j) for 
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q E Qand 1 < j < wy. For a given p E Q, we define Nii) (q,3) for 1 < i < wp and 
1 < j < w in the following way. 

Let X = {(q,jm) | q E Q,1 < j < wọ, 1 < M < Mpg}. The set X has by 
definition Mp: w elements. Since M,-w < kwp, we may partition the set X into wp sets 
X1,..-,Xw, having each at most k elements. We denote by vg, q,j the number of elements 
of the set Xy of the form (q, j, m) for some m. We then define M(p.¢) (4,7) = Veq,j- Thus, 
the number of edges going out of each state (p, £) is the cardinality of X;, and thus at 
most k. 

Define the Q x R-matrix T by Tj (¢,;) = 1 for 1 < j < wq, the other components pene 
0. Then we have MT = TN. Indeed, MTy (4,7) = Mp,q and TNp (4,3) = oe Nio, 0),la,5) 
Mp q. Let A’ = (R,i’,t’) be the weighted automaton with B matrix N ona 
with i’ = (i,1) and t = (t,1). Then A’ recognizes u. By construction the sum on each 
row of N is at most k and thus A’ satisfies the required property. 

We now may label the edges going out of each state with different letters. Since there 
is only one initial state and no edge going out of the terminal state, the automaton 
obtained recognizes a prefix code with length distribution u. n 


EXAMPLE 8.1 Let u(z) = 32? /(1— 22). We have u(1/2) = 1. The series u is recognized 
by the trim normalized weighted automaton of the left aot Figure spate’ result of 
the transformation realized in the proof of Lemma Pemaes apet ented on the right. The 
coordinates of the 2-eigenvector in both cases is indicated in a square. 


2 3 1 2 1 


3 
3 3 
1 2 4 > 1000 0300 0001 —> 
or 


Figure 2.29: A trim normalized weighted automaton of u and the first transformation 


S 
The matrices M, N and T of the proof of Lemma Ba are = 


0 3 0 il p 1 0 sl 1 0 0 ol 

0 0 1 1 00 1 3 0300 
u 0 1 0 0 ae 010 0]/’" 0 0 3 0° 

0 0 0 0 0 0 0 0 000 1 


The second transformation (proof of the theorem) gives the weighted automaton of 
Figure e left. Note that the state with weight 2 is a split in two states (2, 1) 
and (2,2) and that its output is distributed amongst them. The matrices M, N and T 
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Figure 2.30: The second transformation and the final result 


of the proof are 


0100 oh a 10000 
0013 TAE 01100 
P ea a 
0000 CPIE 00001 


A deterministic labeling gives the automaton represented on the right. It recognizes 
the regular prefix code on X = (b?)*{aa,ab,ba}. A final minimization would merge 1 
and 4. 


EXAMPLE 8.2 Let s be the series defined by 


z? z? 


“O° aA A 


Since s(1/2) = 1, it satisfies the Kraft equality for k = 2. The series s is recognized 
by the weighted automaton A = (Q,1,4) given in Fj ure BZ The adjacency matrix of 
= where th 


A admits the 2-eigenvector represented on Figure e coefficients of w are 
represented in squares beside the states. Thus m = 3. 


fig: christophe 


Figure 2.31: A normalized weighted automaton of s 
; : eSuper 
The result of the transformation performed in the proof of Lemma Bal is given 
bs mm this f 


in Figure i is figure, each vector of N@ is represented by a sequence of 
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states with repetitions corresponding to the multiplicity. For example, the Q-vector 
u = (0,0,1,0,0,2,0) is represented by (3,6,6). The coefficients of the eigenvector are 
represented in squares beside the states. 


Figure 2.32: The first transformation. fig:superchristoph 


A regular p efix code X having s as length distribution, is given by its automaton 
in Figure peepee figure, the nodes have been renumbered, with the children of a 


node with a given label represente „only once. The State 0 is final. The automaton is 
obtained from the graph of Figure ae by application of the second construction (proof 
of the Theorem). For example, the state (2,5), which has coefficient 2, is split into two 
states named 2 and 3 in the tree. 


Figure 2.33: A regular prefix code with length distribution s 
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9 Optimal prefix codes 


Let X be a code over some alphabet A, and assume that each letter a € A has a cost 
c(a) associated with it. The cost of a word w is by definition the sum of the costs of 
its letters. 

Assume next that each code word x € X has a weight p(x) associated with it. The 


cost of X is 
Cx = >P p(x)c(x) . 
xrEX 


The prefix coding problem is to find a prefix code X with minimal cost, for given weights. 
In the sequel, weights and costs are positive numbers. 

As usual, the code X can be viewed through a coding morphism, that is a bijection 
y: B — X for some alphabet B which extends into an injective morphism from B* 
into A*. With this in mind, the weight of a word x € C is in fact the weight of the 
letter b € B such that x = y(b). So the cost of X is also 


Cx =o pldje(v()) 


beB 


In the case where all letters a € A have equal cost, the cost of a word over A is merely 
its length. In this case, the prefix coding problem reduces to the construction of a 
prefix code which minimizes 
Cx = >> p(a)|al. 
cEx 

A solution to this problem is called a Huffman encoding. The following greedy algorithm 
computes a solution in the binary case in time O(nlogn). Let A = {a,b}, and p : B > 
R be the weight function. 

If B has just one element c, set y(c) = 1; otherwise, let c1, cz be two elements of B 
such that p(ci), p(c2) < p(c) for all c € B— {c1, c2}. Let 


B'=B- {c1,c2}U {d}, 


with d ¢ B and define p’ : B’ > R, by p’(c) = p(c) for all c € B’ — {d} and p'(d) = 
p(ci) + plea). 

Let y! be a Huffman encoding of (B’,p’) and let y be the function from B into A* 
defined by 


yc) =Y (c) force B\ {e1,e2}, a)=V (da, y(c2) = V (db. 


Let us show that y is a Huffman encoding of (B, p). 

We show that there is an optimal encoding y such that y(c1), y(c2) are words of 
maximal length differing only by the last letter. This proves the desired result. 

Let us consider a prefix code X = y(B) such that Cx is minimal. Let c1,c2 € C be 
such that 7(c1), 7(c2) are the smallest values. Let x,y E€ X be two words of maximal 
length such that z = pa, y = pb. Let c,d € C be such that y(c) = x, y(d) = y. Consider 
the encoding y’ obtained from y by exchanging the values of c,,c2 with the values of 
c,d. With X’ = y'(B), one gets Cx < Cx and thus Cy, = Cx. 
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In the case $`, p(x) = 1, the number Cx is just the average length of the words of X. 


Less is known in the case where the letters used for the encoding have unequal cost. 
The problem is motivated by coding morphisms where different characters may have 
different transmission times. One example is the telegraph channel, in which the dash 
“—” has twice the cost of a dot “-”. Another example is the family of run-length limited 
codes, where two consecutive symbols 1 must be separated by at least a and at most 
b adjacent 0’s. In this model, each word 01 with a < k < b may replaced by some 
symbol in a new alphabet, and the cost of this symbol is cost k + 1. 


The prefix coding problem with unequal letter cost has been considered mainly in 
the case where the costs are integers. 


A special case of the prefix coding problem with unequal letter cost is known as the 
Varn coding problem. This is the prefix coding problem when all the weights of the 
code words are equal. This problem has an amazingly simple O(n log n) time solution. 


Assume that all n code words have equal weight equal to 1. An optimal code mini- 
mizes the cost 


Cx = oe); 


rex 


where the cost c(x) is the sum of the costs of its letters, that is 


ea) =>" e(a)|alq. 


acA 


To construct an optimal code over a k letter alphabet A, assume first that n — 1 is a 
multiple of k — 1, so that is n = q(k — 1) + 1 for some integer q. So the prefix code 
(or the tree) obtained is complete with q internal nodes and n leaves. We use the 
algorithm which starts with a tree composed solely of its root, iteratively replaces the 
leaf of minimal cost by an internal node which has k leaves, one for each letter. The 
number of leaves increases by k — 1, so in q steps, one gets a tree with n leaves. 


EXAMPLE 9.1 Assume we are looking for a code with seven words over the ternary 
alphabet {a,b,c}, and that the cost for letter a is 2, for letter b is 4, and for letter c 
is 5. The algorithms starts with a tree composed of a single leaf, and then build the, 
tree the a algorithm. There are two solutions, both of cost 45, giv n fi ure jejem 
Troe PAR ctnes the prefix code {aa, ab, ac, ba, bb, bc, c}, and oe PENS ives the 
code {aaa, aab, aac, ab, ac, b, c}. 

In order to get complexity O(n log n) for the construction, the leaves of the tree are 
mananged trough a priority queue: insertion of a leaf is done in O(logn) operations, 


and the same time complexity holds for retrieval of a leaf with minimal cost. 
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0 
0 0 0 2 4 
2| [4] [5 2) [4] Tal] T6 7| J6 8 
4| T6 7 
(a) (b) (c) (d) First solution 
0 
2) [4 5 
4) [6 7 
6 8 9 


(e) Second solution 


Figure 2.34: Varn’s algorithm for 7 words. At each step, a leaf of minimal cost is 
replaced by a node with all possible leafs. There are two choices for the last step. Both 


give a minimal tree. 


VARNCODING() 
T — root 
> By definition, the cost of the root is 0 
Q — PRIORITYQUEUE() 
ADD(Q, root) 
while the number of leaves is 4 n do 
f — ExTRACTMIN(Q) 
for each a € A do 
c — MAKECHILD( f) 
cost(c) — cost( f) + cost(a) 
ADD(Q, c) 


return T 


= = 
FOOONDOKR WN FH 


If the size n of the code is not of the form q(k — 1) +1, one constructs a code with m 
elements for the least m > n with m = 1 mod k — 1, and then deletes the leafs of 
greatest cost. 


EXAMPLE 9.2 Assume that in the previous example, a code with only 6 elements is 
looked for. Then one removes the word of cost 9, that is the word cc in the first case, 
and the word aac in the second case. 
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2.9.1 
For the proof, see problem 2 ji 


A special case of prefix coding is coding that is compatible with a given ordering of 
the input alphabet. 

Consider a coding morphism y : B* — A*, where A and B are alphabets equipped 
with an order. Then y is an ordered coding if 


b < b => q(b) < yb") 


where the order in A* is the lexicographic order induced by the order on A*. If y is 
a prefix coding, and if the prefix code X = y(B) is viewed as a tree, this means that 
the leaves of the tree, read from left to right, correspond to the encoding of the input 
letters in B, read in alphabetic order. Such a tree is called ordered tree. The ordered 
prefix code problem is to find a ordered coding that with minimal cost 


Cx = > p(b)\r(b)| 


beB 


where p(b) is the weight of b. 


EXAMPLE 9.3 Consider the alphabet B = {a,b,c}, with weights p(a) = p(c) = 1 
and p(b) = chain igure a) shows an optimum tree for these weights, and fig- 
ure an Optimum ordered tree. This example shows that Huffman”s algorith 
does not give the optimal ordered tree. 


(2) [4 1 
b c 
1 1 1 4 
a č a b 
(a) Optimum coding (b) Optimum ordered coding 


Figure 2.35: Two trees for the given weights. Tree (a) has cost 8, it is optimal but not 
ordered. Tree (b) is ordered and has cost 11. 


B, EQ Ap egusider the sequence of weights (4,3,3,4). A optimal tree is given in 


Figure shows that in an optimal ordered tree, leafs with minimal weight need 
not to be adjacent. 


Let B = {b,,...,b,} be an ordered alphabet with n letters, and let p; be the weight 
of letter b;. We present an algorithm for computing an optimal ordered tree, and due 
to Garsia and Wachs (see Notes). The algorithm is composed of two parts. In the first 
part, called the combination part, one starts with the sequence of weights 


p = (P1; ---; Pn) 
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3 3 4 
b c d 


Figure 2.36: The minimal ordered tree for weights (4, 3,3, 4). 


and constructs an optimal binary tree T” for a permutation bo(1),.--, bo(n) of the al- 
phabet. The leafs, from left to right, have weights 


Po(1)» +-+: Po(n)- 


In neral, this permutation is not the identity, so the tree is not ordered, see Fig- 
are BEER second part, called the recombination part, one constructs a tree T 


25 22 20 14 25 20 22 14 
a b e a b e 
12 10 12 10 
c d c d 
(a) After combination (b) After recombination 


Figure 2.37: The two steps of the algorithm: (a) the unordered tree obtained in the 


combination phase, and (b) the final ordered tree, obtained by recombination. 


which has the weights p1,...,Pn associated to its leafs from left to right, and where 
each leaf with weight p;,appears at the same level as in the previous tree T”. This tree 
is ordered (see Figure Bee 

Saying that each the leafs have the same level in T and in T’ means that the code 
words 7(b;) and 7/(b;) defined by the trees have the same length, and therefore the 
trees T and T’ have the same cost. Thus T is an optimal ordered tree. 

We describe first the construction of the tree T”. 

The algorithm operates on a sequence of weights p = (p1,...,Pn) of weighted trees 
ty,...,tn. The weight of each tree is the sum of the weights of its leafs. 

One starts with the sequence p = (pj,..-,Pn) of weights of the letters (b1,...,bn), 
where each tree t; is just a leaf labelled with b; and with weight p;. At each step, two 
well chosen adjacent trees t,_1,t, of the current sequence are combined into a new tree 
t’ with left subtree t,_; and right subtree tg and with weight the sum pk—1 + pk of the 
weights of these two trees. The two trees t,_1,¢, are removed from the sequence of trees 
and the new tree t’ is inserted in the sequence at an appropriate position. Similarly, 


Version 22 janvier 2008 J. Berstel, D. Perrin and C. Reutenauer 


9. Optimal prefix codes 151 


the weights py_; and pz are removed from the sequence p, and the weight pz_1 + pk is 
inserted into the sequence. 

After n — 1 steps, one gets a single tree which is the tree T’ looked for. To simplify 
the presentation, it is convenient to introduce two virtual trees to and tn+1, both with 


weight +00. 
For the description of the selection procedure, it is convenient to introduce some 
terminology. A sequence (pj,...,px) of numbers is 2-descending if pi > pi+2 for 1 < 


i < k— 2. In othere terms, one has pı > p3 > ps >+- and po > p4 > pe>-::- 


EXAMPLE 9.5 The sequence 14, 15,10, 11, 6,8, 4 is 2-descending, since 14 > 10 > 6 > 4 
and 15> 11> 8. 


Clearly a sequence (pi,...,p%) is 2-descending if and only if the sequence of “two-sums” 
(pi + po,---,Pk-1 + pk) is strictly decreasing. 

We now introduce the definition of left minimal pair, of target and of scope for ease 
of description of the algorithm and of its proof. 

Let p = (pi,.--,Pn) be a sequence of (positive) weights. The left minimal pair of p is 
the pair (pp_1, px), where (pi1,..., px) is the longest 2-descending chain that is a prefix 
of p. The index k is the position of the pair. In other words, k is the integer such that 


Pi-1 > pe (1<i<k) and Pk-1 Speen. 


EXAMPLE 9.6 For (14,15, 10,11,12, 6,8,4), the minimal pair is (10,11), and its posi- 
tion is 4, whereas for the sequence (28, 8, 15, 7, 20,5), the minimal pair is (15,7) and its 
position is 4. 


Observe that the left minimal pair can be defined equivalently by the conditions 


Pi-1 + Pi > pi +pi+1ı (<ti<k) and Pk-1 +Pk < Pk + Pky. 


This writing shows more clearly that one looks for minimal pairs of consecutive weights. 
Given a sequence p = (pj,...,Pn) and the position k of its minimal pair (pk—1, Pk) 
the target is the index j with 1 < j < k such that 


DAS Pk-1 + Pk > Pj,- -Pk 


EXAMPLE 9.7 For (14, 15, 10, 11, 12, 6,8, 4), and the minimal pair (10,11), the target is 
1, whereas for the sequence (28, 8, 15,7, 20,5), and the minimal pair (15,7), the target 
is 2 since 28 > 15+7> 8. 


The pair (j, k) composed of the position of the left minimal pair and of its target is called 
the scope of the sequence p. Observe that the sequence (pj—1, Pk—1 + Pk, Pj, ---, Pk-2) 
is 2-descending since pj_1 > Pk-1 + Pk > Pj, Pj41- 

The first phase of the algorithm works on a sequence of weights and an associated 
sequence of trees. Let p1,...,pn be the current sequence weights. 
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Combination 
(i) compute the left minimal pair (pp—1, pK) (the position k is the least integer k, 
with 1 < k < n, such that pg- < pk (remember that pn+1 = +00)). 
(ii) compute the target j (this is the greatest integer with 0 < j < k, such that 
Pj-1 > Pk-1 + pk (remember that po = +00)). 
(iii) (a) remove the weights pz_1 and Pk, 
(b) insert pr_1 + pk between pj—ı and pj. 
(c) associate to pk—1 + pz a new tree t with left subtree t,_, and right subtree 
tk (and with weight pr_1 + pr). 
This step is repeated as long as the sequence of weights has more than one element. 
The result of the algorithm is the tree associated to the single remaining weight. 


EXAMPLE 9.8 Consider the following weights for an alphabet of five letters. 


a b c dee 
25 20 12 10 14 


The initial sequence of trees is 


25 20 12 10 14 
a b c d e 


The left minimal pair is 12,10, its target is 2, so the leaves for c and d are combined 
into a tree which is inserted just to the right of the first tree: 


25 22 20 14 
a b e 


Now the minimal pair is (20,14) (there is an infinite weight at the right end), so the 
leaves for letters b and e are combined, and inserted at the beginning: 


34 25 22 
a 


25 (22) 20) [14 


12 10 
c d 


Finally, the two remaining trees are combined: 


Version 22 janvier 2008 J. Berstel, D. Perrin and C. Reutenauer 


9. Optimal prefix codes 153 


25 (22) 20) [14 


12 10 
c d 


The tree T’ obtained at the end of the first phase is not ordered, in general (in the 
example, the sequence is a,c,d,b,e). The two next steps compute a ordered tree in 
which the leafs of the letters have the same level as in the (unordered) tree T”. 


Level assignment Compute, for each letter b in B, the level of its leaf in the tree 
T. 

This operation is quite simple and is done in a tree traversal The final step is the 
following. 


Recombination Construction of an ordered tree T in which the leafs of the letters 
have the levels computed by the level assignement. 


EXAMPLE 9.9 The prescribed levels for the letters of the example are: 


abcde 
22 3 3 2 
The optimal ordered tree with these levels is: 


Qa 
g~“ 
o 


Let T be some binary tree with n leaves labelled by the letters b1,...,bn of the 
alphabet B, with weights pi,...,Pn. we denote by £T (or simply ¢;) the level of the leaf 
of b; in T, that is the length of the code word coding the letter b;. Each of the partial 
trees constructed in the algotihm will be identified with its root, considered as a leaf. 
The leaf corresponding to the letter b; will we denoted by Aj. 

We first state two simple lemmas. 


LEMMA 9.1 Let T be some binary tree. If €; > 041, then A; is a right leaf. Symmetri- 
cally, if li < €j41, then A; is a left leaf. 
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Proof. Assume indeed that A; is a left leaf. Then its right sibling is a tree containing 
the leaf A;41. Thus 4; < 41. m 


The following statement is a first step to the proof of the correctness of the algorithm. 


LEMMA 9.2 Ifpi—1 > pi+1, then li < li+ı in every optimal ordered tree. If pPi—-1 = Pi+1, 
then li < €;41 in some optimal ordered tree. 


£,-1 
kiji Ai+1 R 
Ài Ai+1 
(a) Before reorganization (b) After reorganization 


1 
Figure 2.38: Reorganizing leafs in Lemma ba 
Proof. Suppose p;—1 > pi+1, and consider a tree T with 4; > €;,1. In this tree, the leaf 
A; is a right, chil by Lemma and its left sibling is a tree L with weight p(L) > p;-1, 
see Figure fae Build a new tree T” as follows: replace the parent of L by L itself, 
replace the leaf of A;,; by a node having as childs the leafs A; and A;41. The difference 
of the costs is 


Cr — Cr = —p(L) + piy1 — pills — G41 — 1) < Pi+1 — Pi-1 


because 4; > €;41 +1. If pj_-1 > pi+1, then this expression is < 0 and T is not optimal. 
If pi-1 = pi41 and if T is optimal, then T” is also optimal, and er =f. n 


Observe that the symmetric statement also holds 


COROLLARY 9.3 Ifpi—-1 < pi+1, then li—1 > 4 in every optimal ordered tree. If pi—1 = 
Pi+1, then £;-1 > 4; in some optimal ordered tree. 

1 . 
We use Lemma Bin the following form 


COROLLARY 9.4 If the subsequence (pj—1,..., pk) is 2-descending, then Lj < -+ < & 
in every optimal ordered tree. n 


We now show that we always may assume that the minimal tree for a sequence p has 
some special form. Such a tree will be called flat. 


PROPOSITION 9.5 Let (j,k) be the scope of the sequence p = (p1,..., Pn). There exists 
a minimal tree for p satisfying lk—ı = lk and one of the two conditions 
(a) fy=lj+1 or 
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(b) l = L and dj; is a left leaf. 


Proof. Since the sequence (p1, ..., pk) is vila Bae po = +00), one has 44 < 
ly < +++ < Lp in every minimal tree by Corollary 9-4) Next pk—-1 < presi. IÉ Pk-1 < peat 
then ¢,-; > lk in every minimal tree, and if pk-1 = Pk+ı then lk-1 > k in some 
minimal tree. Thus f%—1ı = lk in some minimal tree. Consider this tree. We prove that 


s+1 s s+1|is+2 


t-1 t)j/t+1 


t t+1 
(a) Before the shift (b) After the shift 


3 
Figure 2.39: Proof of Proposition bo 


Lj = & or lj = lk — 1. Assume the contrary. Then £; < lk — 2. Let s be the greatest 
index such that ls < l —2. Then s < k —1 because ¢,_; = lk. Let t be the smallest 
index such that 4 = k. Then 


bj St L Ls < bopi Ss Se < ht SH Hh 


It is quite possible that s + 1 = t. Observe that As+1 is left leaf by Lemma Br because 
ls < 541. Similarly, A; is a left leaf, and_. apd At+1 are siblings. We now make 
the following transformation, see Figure Bae Deaf A; is replaced by a node with the 
two siblings A, and A,41. Each of the leafs A,49,...,A#-1 is shift to the left. The 
leaf Az replaces Az-1, and the parent of Az41 is replaced by A441 itself. The extra 
cost of this transformation is at most ps — pt — pr4+1 because the level of As increases 
by 1, the level of A,41 does not increase, the levels of Ay and A441 decrease by 1. Now 
Ds — Pt — Pt+1 < Ps — Pk-1 — pk because pi + pr4i > pk-ı + pk (equality is possible 
because one might have t = k—1, and the extra cost is < 0 because j > s and therefore 
Ps < Pk-1 + pr. This yields a contradiction and shows that ¢; > ¢, —1. It remains to 


j-1 


(a) Before the transforma- (b) After the transformation 
tion 


3 

Figure 2.40: Second transformation in Proposition por 
consider the case where £; = l4. Arguing by contradiction, assume that A; is a right 
leaf. Then, since 0;_1 < ¢;, the leaf A;_1 : a eft leaf and is the sibling of Aj. The make 


the following transformation, see Figure | Replace the common parent of A;_; and 
Aj by Aj—1, shift Aj,..., A,—2 one position to the right, and replace the leaf A% by a node 
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with children Az_; and Ax. Since the leafs A;_1,..., Ag have the same level before the 
transformation, the extra cost is —pj—1 + pz—1 + Pk. This value is < 0 by the definition 
of the target. Since the tree was minimal before the transformation, the tree after 
transformation has the same cost. In this new tree, one has indeed ¢; = 1+ ¢;. f 


A tree T for p is k-minimal if it is minimal among all tree where the leaves for pk—1 
and px are siblings. 

A level preserving permutation o of tree T is a tree T° that has the same leaves than 
T at the same levels. By definition, the cost of T is equal to the cost of T. 


LEMMA 9.6 Let p= (pi,.--,Dn) be a sequence of weights with scope (j,k) and let T be 
an optimal flat tree for p. Let 


p = (pı, ---,Pj—1,; Pk—1, Pky Pj, Pj+1,;---, Pk—2, Pk+1;- Pn). 


There exists level preserving permutation that transforms T into a tree T’ for p' such 
that the leaves for pp_1 and pg are siblings. 


Proof. Since T is flat, 0; = £p or lj = lk — 1. If ¢; = lk, one makes a circular shift 
of the leaves \;,...,Ax two positions to the right. Since \; was a pi child, before the 


shift, the leaves Az,_ 1 and Xx are siblings after the shift, see Figure : 3 = &—-1, 
j k—-1||k k-11} fkl lj k-2 
(a) Before the circular shift (b) After the circular shift 


Figure 2.41: The case 4; = €,. 


let s be such that £; = Lj, €s41 = lk. Then one first makes a Bee of the leaves 


Às+1,.--, Ak two positions to the right, as before, see Figure ; 
j 5 D j s R 
s+1 k—1||k k-1] fk] [s+1][k-2 


(a) Before the first shift (b) After the first shift 


Figure 2.42: The case lj = lk — 1: A circular shift. 


Then one applies a circular shift, one position to the right, of HS sequence Aj,...,A— 
s, x, where x is the parent node of A,_1 and Ag, see Figure is is a transformation 


that preseves levels of leafs and therefor the resulting tree has the same cost as the tree 
T we started with. 


THEOREM 9.7 Let p = (pı,...,Pn) be a sequence of weights with scope (j,k) and let 
Dp = (p1, +++) Pj—1,Pk—-1 + Pk, Pj, Pj+1, +++) Pk—2; Pk+1;»+ > Pn). Let T be a minimal tree 
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k-1 k k-1 k 


(a) Before the second (b) After the second shift 
shift 


Figure 2.43: The case £; = £k — 1: The second shift. 


for p, and let T’ be the tree obtained by substituting a tree with two leaves \p_1 and Ak 
to the leaf corresponding to pk—-ı + pk in T. There exists a minimal tree T for p of cost 
c(T) = c(T") which is obtained by a level preserving permutation of T". 


Proof. Let T be an optimal tree for p. Since c(T’) = c(T) + pp_1 + pr, the tree T’ is 
k-minimal for 


p = (p1, ---Pj—1;Pk—1; Pk, Pj, Pj+1;- -+ Pk-2; Pk+1;--- Pn) . 


If 7 -1 = k — 2, then p' = p and there is noting to prove. Otherwise, observe that 
sequence 
Pj-1,Pk—1 + Pk, Pj, Pj+1, -+ + Pk-2 


is a 2-descending factor of the sequence p because pj—_1 > pk-1ı + Pk > pj and pk-1ı + 


Pk > pj4i1. Therefore, denoting by g the leaf in T with weight pp; + pk, one has 
5 > á is 
ix i Le A 9 by Corollary Pat The node zx is also the parent node of the leafs 


for pp—1 and px in T’, and since ef = (T for all nodes of P, one has £, < 4j < +--+ < lk-2 
in T”. 


k-1 k k-1 k 


(a) Before the shift (b) After the shift 


| en 
Figure 2.44: The case ¢, = lk—2 in Theorem i 


We distinguish two cases. If l} = lk- then one makes the following transformation: 
the nodes x, A;,...,Ax—2 are cyclically permuted one position to the left, yielding, the ee 
nodes A;,...,Ax—2,2 and therefore the leaves Aj,...,Ax—2, Ak—1, Ak, See Figure PAP 
The resulting tree S verifies c(T) = c(T’) and the permutation is level preserving. 


ORE 8 j s 
k-1 [k-1] [k]s+1 
k-2 k-2 
(a) Before the first shift (b) After the first shi 


Figure 2.45: The case ¢, < ¢,_9: first transformation. 
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If ln < Aveo let s such that l = ls < 541. Then a first transformation (see 


Figure similar to the previous one but on z,...,A,5 gives a tree where the leave 
sequence is Aj ...,As—1, Ak—1; Ak, As+1; <- -, Ak—2- One has fp_1 = bk < l5410--+ < lko. 
A circular permutation by two positions to the left of the eaves Aye—1 Ak, Asts ++ +s AR—2 
yields the sequence A 5+1,---,Ak—2; Ak—-1; Ak, See Figure 5 
: a 
k-1 k s+l s+l S+)s+3 
k—2 k 
(a) Before the second shift (b) After the second shift 


Figure 2.46: The case ¢, < lk—2: second transformation. 


By Lemma Best below, the cost of t ç resulting tree S' is less than the cost of T’ unless 
lk-2 = lk. But in view of Lemma I.0, c(S) cannot be strictly less than ce(T’). n 


LEMMA 9.8 Let m > 3, let lı = b2 < --- < Lm be integers and let (p1, p2,..., Pm) be a 
2-descending chain. Set 


C = Pm-1b1 + Pml2 + pilz +`- + Pm-2bm , 
d = pihi + polo ++: + Pmlm- 
Then c < c, and equality holds only if bm = 4. 


Proof. If m = 3, then 
c — c= (pı — ps)(41 — £3) < 0 


and indeed c’ = c only if 4 = %3. 
If m > 4, then 


d — c= pi (b — L3) + po(lo — 44) +++ + Dm—2(lm—2 — Lm) 
+ Pm-1(lm-1 _ l) + Dm(lm = Lla) : 


Since (p1, p2,---,DPm) is 2-descending, the m — 2 first terms of this sum may be grouped 
and bounded. If m is even 


d — c < pm—3(41 — lm—1) + Pm—2(l2 — lm) + Pm—1(lm—1 — £1) + Pm (lm — £2) 
= (Pm 3 — Pm 1)lm = lı) F (Pm-2 — Pm)(lm = La) <0 


and equality holds only if /m-1 = 4 and fm = %2, so only if 4 =--- = lm. Similarly, 
if m is odd, and because ¢; = £2, one gets 


c —cx< Pm—2(l1 = lm) + Pm-3 (b2 = Lm-1) + Dm—1(lm—1 = £1) + Pm(lm = b2) 
= (Pm-3 — pm-1 (4 _ Cet) F (Pm—2 — Dm) (1 = Lin) <0 


Again, equality holds only if 4 =--- = lm. n 
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10 Exercises 


. section2.1 
Section 


1.1 Let A be a finite alphabet, and let P be a prefix-closed subset of A*. Show that 
P is infinite if and only if there exists an infinite sequence (pp)n>1 of elements in P 
such that 


Pi < p2 <p3<-°: 


1.2 Let A be a finite alphabet of k letters and let X C AT be a prefix code. Show 


that 
> Ank "<1. 


n>1 


tion 


g section2.2 
Section 


2.1 Let X C AT be a prefix code. Let P = XA- and let 


dr.B 
a aa an elementary proof of Corollary MeS ior prefix codes. See also Proposi- 


A= (P,1,1) 
be the literal automaton of X*. Consider an automaton 
A=(Q,i,i) 


which is deterministic, trim, and such that X* = Stab(i). Show that there is a surjective 
function 


pP : P — Q 
with p(1) = i and such that for a € A, 
plp: a) = p(p)-a. 
section2.3 

Section B 

3.1 Let X C A* be a finite maximal prefix code and P = X A~. Show that 

Card(X) — 1 = Card(P)(Card(A) — 1). 

3.2 Let A be an alphabet, and let M(A) be the monoid of prefix subsets of A* 
equipped with the induced product. Show that M(A) is a free monoid and that the 
set of pie (esp, recognizable) prefix sets is a free submonoid of M(A). (Use 


Exercise [E.S and set A(X) = minzex |z|). 
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i section2.4 
Section 


4.1 Let A = {a,b}. Associate to each maximal prefix set Z C A*, a word y(Z) € A* 


as follows: 
(H=! 


and if Z = aX UbY, 

(Z) = ay(X)by (Y). 
Let D C At be the Dyck code and Da = D N aA*. Show that y is a bijection of the 
set of maximal prefix sets onto Dž and further that 


|y(Z)| = 2 Card(Z) — 2. 


Derive from this that the number of maximal, þi aly prefix sets with n elements is 
Card(D* N A?"-*) = (1/n) Cy) (cf. Example 


n-1 
4.2 Let X and Y be two prefix codes over A, and 
P=A*\XA*, Q=A*\YA*. 
Set R= PNQ. Show that there exists a unique prefix code Z such that 
Z=RA\R. 


Show that 
Z=(XNQUANY)U(PNY). 


Show that if X and Y are maximal prefix sets, then so is Z. 


4.3 Let A be a finite alphabet. Show that the family of recognizable maximal prefix 
codes is the least family F of subset of A* such that 


(i) AEF, 
(ii) if X,Y € F and if X = X1 U Xə is a partition in recognizable sets X1, X2, then 


Z=X,UX2Y EF. 
(iii) if X € F and if X = Xı U X2 is a partition in recognizable sets, then 
=x ner, 


p section2.5 
Section 


5.1 Let X C A* be a prefix code. Show that the following conditions are equivalent. 
(i) AX = X*. 
(ii) X is a semaphore code, and the minimal set of semaphores 


S=X\A'X 


satisfies 
SANA S=SA*SUS. 
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Note that for a code X = A*w \ A*wA?, the conditions are satisfied provided w is 
unbordered. 


5.2 Let J C At be a two-sided ideal. For each x € J, denote by ||2|| the greatest 
integer n such that 
reJ” 


and set ||x|| = 0 for x J. Show that, for all x,y € A* 


lell + llull < lleyll] < llel + llull +1. 


: section2.6 
Section 


6.1 Let X C At be a finite maximal prefix code. Show that if X contains a letter 
a € A, then there is an integer n > 1 such that a” is synchronizing. 


6.2 Let A be a complete determinstic automaton with n states. Show that if A is 
synchronized, there exists a synchronizing word of length at most n? in A. 


6.3 Let n > 1 be an integer and let M be the monoid of mappings from Q = Z/nZ 
into itself generated by the two maps a,b defined for i € Q by ia = i + 1 and 


pa ait sicnt); 
10 = 
i+1 (n-—t<i<n) 


for some integer t with 1 < t < n. In the case where t divides n and ib =i+1 mod d 
for all 2 € Q, the minimal rank d of the elements of M is t. The aim of this exercise is 
to show that this is the only possible case when d > 1. 

For each e, f withO<e< f <n, let Ief ={e,e+1,...,f—1} and let Mar = {m E 
M | Qm = Ie ș and im = i for all i € Te f}. 
(a) show that for each j € Q 


Te paf = le+j,f+j and a4 Me, paf z Me+j,f+j . 


(b) show that Mo, is not empty (hint: show that ba~! has a power in Mn—t,n). 

(c) let d be the least integer such that Mo,q is not empty. Show that Mo,a is formed of 
one element m such that i- m =i mod d for all i € Q (hint: arguing by contradiction, 
let j be the least integer such that j -m Æ j mod d. Use a’—4m to show that one may 
reduce to the case j = d. Then show that some power of ma fixes an interval of less 
than d elements). 

(d) show that d divides n (hint: let n = dq +r with q > 1 and 0 < r < d. Show that 
some power of a”™"m is in M,). 

(e) show that ib =i+1 mod d for each i € Q. 


6.4 Let X be a maximal prefix code on the alphabet A = {a,b}. Let a” € X and let 
Y = XNa*ba*. Set Y = {y0, y1, .-., Yn} with y; = aba. Suppose that 
(1) there is an integer m > 1 such that a” is not a factor of a word in X. 
(2) for each i, we have |y;| < n with equality if and only ifn—-—t<i<n-l. 
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(3) the lengths of the words of Y are relatively prime 
-synchro 
Show that the code X is synchronized (hint: use Exercise ae 


6.5 Let X C AT be a prefix code and let X = Y o Z be its maximal decomposition. 
Show that if X = Y’ o Z' with Z’ prefix and Y’ maximal prefix, then 


POCZT 


. section2.7 
Section 


7.1 Let X C A* be a maximal prefix code, and let a: X —]0, 1] be a function such 
that 
iD a(x)=1. 
xEX 
Show that there is a cylindric measure on A* such thatr (x1 --- £n) = a(zı)---al£n) 
for £1,..., 2n E X. (Set (p) = X pucx a(pu) for p € XA.) 
7.2 Let X C At be a thin maximal code and let m : X —]0, 1] be a function such 
that 
`> meal, 
rEX 
Define the entropy of X (relatively to 7) to be 


H(X) = — J ` n(x) log, (2), 


xEX 


where k = Card(A). Set A(X) = X zex læl|r(z). 
Show that 
H(X) < A(X), 


and that the equality holds if and only if 
n(x) =k-Pl for tex. 
Show that if X is finite and has n elements, then 


H(X) < log, n. 


. section2.Tbis 
Section 


8.1 Show that u = (un)n>1 is the length distribution of a thin maximal prefix code 
on k letters if and only if 


(i) 
`» unk” =1, 


n>1 


(ii) there is an integer p > 1 such that the sequence v = (Vn )n>0 defined by u(z)—1 = 
u(z)(kz — 1) satisfies Un+p < Un(k? — 1) for all n > 1. 
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(Hint: show that if condition (ii) is satisfied, then u is the length distribution of a 
maximal prefix code X such that a?? is not a factor of the words of X). 


8.2 Show that the sequence u is the length distribution of a synchronized thin maximal 


prefix code if and only if it satisfies the additional condition 


(iii) the integers un are relatively prime. 


i . onetSynchro 
(Hint: use Exercise 
g section2.9 
Section 


9.1 Prove that the code produced by Varn’s algorithm is indeed optimal. Show that 
in the case n Æ 1 mod k—1, the algorithm proposed is correct, and can be implemented 
in time O(n logn). 


11 Notes 


The results of the first four sections belong to folklore, and they are known to readers 
familiar with automata theory or with trees. 


10 
The hypothesis of unambiguity is necessary in Proposition Fra tsce (L987). 
Semaphore codes were introduced in (L964) under the name of J 


codes. All the resul presented in Secti n f can be found in that paper which also 
contains Theorem 6.5, and Proposition ae 


The notion of synchronized prefix code has been extensively studied in the context 
of automata theory. Let us mention Cerny’s problem: given a complete deterministic 
automaton with n states which is synchronized, what is the least ypp gr „bound to the 
length of a synchronizing word as a function pf n? See Exercise FSC Kizord (L950), 
(1964), and Pin LITS): g-xample eee ntained by a construction of 
(1977al) (see Exercise [[.9). The synchronization properties of a special class of prefix 
codes called incl sion. codes have been studied ee (L969) and 
(1972). Exercise 6.4) 1s due to (L967). The maximal decomposition of 
prefix codes and Propositions pb. 1s due to (L972). 

The results of Section [// are given in another terminology in (L968). The pair 
formed by a prefix code X and a cylindric measure p such that p(xy) = p(x)p(y) for 
all z,ye X*d fines what Feller calls a recurrent event. ete Se 

Theorem paom (2000). The method used in the proof of Lena 
has been used before in symbolic dynamics (see (L979) or (L983) ). 


The transformations between the various weighted automata recognizing a given series 
used in the proof of the theorem have been systematically studied in (2005). 


Huffman’s algorithm (Huffman, is presented in most textbooks on algorithms. 
It has numerous applications in data compression, and variations such as the adaptative 


Huffman algorithm have been developed, see (L985). 
Run-length limited codes have applications in practical coding, see 
(1995). 


The case of code words with equal weights and unequal letter cost has been solved 


by (L971). Another algorithm is (Perl et al.) 1975). 
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Karp (Karp, (1961) gave the first algorithm yielding a solution to the general problem 
with integer costs. His algorithms reduces to a problem in integer programming. 

Another approach by uses dynamic programming. Their 
algorithm produces the solution in time O(n"*?), where n is the number of code words 
and « is the greatest of the costs of the letters of A. This algorithm has been improved 


to O(n") in the case of a binary alohabet, in (Bradford et al. 2002). 


Ordered prefix codes are usually called alphabetic trees. The use of dynamic program- 


ming technique for the construction of optimal alphabetic trees goes back to 

(1959). Their algorithm is O(n?) in time and O(n?) in space. (L971) 
reduces time to O(n?). 

We follow for the exposition and the proof of the Garsia-Wachs algo- 


rithm (see also (i977); (9), and (E955) for 


the drawings. The Garsia-Wachs algorithm is simpler than a previous algorithm given 


in [Hu and Tuckey} ((L971}) which was also described in the first edition of Knuth’s book. 


For a proof and a detailed description of the Hu-Tucker algorithm, and complements 
see (R002); (L998). 
There is no known polynomial time algorithm for the general problem, nor is the prob- 
lem known to be NP-hard. A polynomial time approximation scheme, that is an algo- 
rithm that produces a solution which is optimal up to 1+e€ in time O(n log n exp(O( + log +))) 


ie |r 
Exercises þ.2, are from (L998). 
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Chapter III 


Deciphering Delay 


0 Introduction 


This chapter is devoted to codes with finite deciphering delay. Intuitively, codes with 
finite deciphering delay can be decoded, from left to right, with a finite lookahead. 
There is an obvious practical interest in this condition. Codes with finite deciphering 
delay form a family intermediate between prefix codes and general codes. There are 
two ways to define the deciphering delay, according to whether one considers symbols 
in the alphabet or code words. The first one is called literal delay, and the second one 
verbal delay, or simply delay for short. 

The first section is devoted to codes with finite (verbal) deciphering delay. We present 
first some preliminary material. In particular we prove a characterization of the de- 
ciphering de ay jn terms of simplifying words. We prove Schtitzenberger’s theorem 
(Theorem saying that a finite maximal code with finite deciphering delay is prefix. 

We prove that any rational code with finite deciphering delay is contained in a max- 
imal rational code with the same delay. 

The second section considers the literal deciphering delay, that is the deciphering 
delay counted in terms of letters instead of words of the code. This definition is more 
useful from the practical point of view. We introduce the notion of automata with finite 
delay. We prove that for finite codes, it is equivalent to have finite literal deciphering 
delay and to be recognized by an automaton with finite delay. We use this character- 
ization to give yet another proof of Schtitzenberger’s theorem. Next, we show that a 
rational completion with the same delay exists. 


1 Deciphering delay 
A subset X of At. is said to have verbal deciphering delay d > 0 if for all z,x' € X, 
ye X? y E X* 

xy < x'y' implies x = 2’. (3.1.1) 


Recall that we write u < u’ to express that u is a prefix of u’. We usually omit the 
adjective verbal for simplicity. If X has finite deciphering delay, then the smallest 
integer d satisfying Gea. called the minimal deciphering delay of X. This notion is 
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clearly oriented from left to right; it is straightforward to define a dual notion (working 
from right to left). 


Figure 3.1: Forbidden configuration for finite deciphering delay. 


The terminology is justified by the following consideration: let w € A*,be a word hav- 
ing two prefixes in Xt, and such that the shorter one is in X!+4, If fas then 
the two X-factorizations start with the same word in X. Thus, the ”delay” between 
the moment when, reading from left to right, a possible factor of an X-factorization is 
discovered, and the moment when these factors are definitively valid, is bounded. 

It follows from the definition that the sets with delay d = 0 are the prefix codes. This 
is the reason why prefix codes are also called instantaneous codes. In this sense, codes 
with finite delay are a natural generalization of prefix codes. 


PROPOSITION 1.1 A subset X of At which has finite deciphering delay is a code. 


Proof. Let X have deciphering delay d. Assume that X is not a code. Then there is 
an equality 
W = T172: En = Y1Y2 Ym, Tı FY 


i 
with n,m > 1, £1,..., En; Y1;---;Ym E X. Let z € X. Then wz € yı X*. By (ty 
we have xı = yı. Thus, X is a code. m 


EXAMPLE 1.1 The suffix code X = {aa, ba, b} has an infinite deciphering delay. Indeed, 
for all d > 0, the word b(aa)? € X'*7 is a prefix of y(aa)¢ with y = ba Æ b. 


mond .3 
For a set X C A’, define, as in Section ing a sequence (Un)n>0 of subsets of A* by 
setting 
UjeSxX X= 1, Um =X U,00,'x,. w= 1. 


PROPOSITION 1.2 The set X has finite deciphering delay if and only if the set Un is 
empty for some n. 


Proof We use the fact that for n > 1, one has u € Un iff there are 71,...,2j,91,-.-Yj € 
X with xı Æ y1, i +t j =n + 1 and u suffix of y; such that 2, --- xju = yiyo--- yj. We 
first verify that if X has deciphering delay d then Usq4+1 = . Suppose the contrary. 
Let x£1,..., £i, Y1,---Yj E X be such that zı- -xiu = yiy2 -yj with i + j = 2d + 2, 
u suffix of yj and zı Æ yı. Then i — 1 < d — 1 since otherwise xı = yı. Similarly, 
j— 2< d-—1 since otherwise, with y; = vu, we have yy2---yj-1U = £1: x; and thus 
zı = yı again. Thusi+j < 2d+1, a contradiction. Conversely we show that if U, = 0, 
then X has deciphering delay n — 1. Let indeed x, x’ € X, y € X™!, j > 0 and u € A* 
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be such that zyu € 2! X!. Then u € Um with n < m. This forces x = x’ proving that 
X has deciphering delay n — 1. n 


EXAMPLE 1.2 The set X = {a, ab, bc, cd, de} has deciphering delay 2. We obtain U1 = 
{b}, U2 = {e}, U3 = {d}, U4 = {e}, Us = 0. 


We reformulate the definition of deciphering delay as follows. A word s € A* is said 
to be simplifying if for all x € X* and v € A*, 


zsv E X* > sv € X* 


PROPOSITION 1.3 Let X C A* be a code. Then X has deciphering delay d if and only 
if all words of X? are strongly simplifying. 


Proof Let us first suppose that X has delay d. Let x € X%, z1,... ,t,) E X and u € A* 
be such that xı ---x,xv E€ X*. Thus 


Ly LpXLV = Y1: Yq 


for some y1,..., Yq E X. Since X has delay d, it follows that zı = y,...,%p = Yp, 
whence q > p and zv = Yp41°+* Yq. Thus zv € X*. This shows that x is simplifying. 
Conversely, suppose y € X%. Let x,x' € X and u € A* be such that xyu € x'X*. 
Then yu € X*. Since X is a code, this implies x = x’. Thus X has deciphering delay 
d. n 


We now study tion codes, with finite deciphering delay. The following result is 


similar to Proposition [IL 


PROPOSITION 1.4 Let X be a subset of At which has finite deciphering delay. If y € 
A* is an unbordered word such that 


X*yA* nN X* =9 
then Y = X Uy has finite deciphering delay. 


Proof. Consider the set V = X*y. It is a prefix code. Indeed, assume that v = xy and 
v = g'y with z,xz' € X*, and v < v’. Then necessarily v < 2’ since y is unbordered. 
But then 2’ € X*yA*, a contradiction. Note also that 


VIAN X*=0 


since Vt A* C VA*. 
Let X have deciphering delay d and let 


e=d+ly| 
We show that Y has deciphering delay e. For this, let us consider a relation 
W = Y1Y2 + Yeti U = Y1Y2 Yn 
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with y1, .-.,Ye+1:Y4;--- Yh E€ Y, u € A* and, arguing by contradiction, assume that 


yı FY 
First, let us verify that one of y1,...,Ye+1 is equal to y. Assu $ dhe contrary. Then 
Ye Yar1 E XH. Let q be the smallest integer such that (Fig. 


Yio Yar S Y1 YG 


The delay of X being d, and yı 4 yj, one among y,...,y/, must be equal to y. We 
cannot have y; = y for an index i < q, since otherwise y;--- ya41 E€ Vt A*N X*. Thus 
y4 =y and y,---y, E V. Note that 


Yi Yy- SY Yast 
Next, |Ya+2°-*Ye+i] > e — d = |y]. It follows that 
Yio Yg L Y1 Yel: 


But then yi +--+ Yer E X* NX*yA*, which is impossible. This shows the claim, namely, 
that one of y1,...,Ye+1 is equal to y. 


y e+ 
: Yaz: Yori 


Figure 3.2: 


It follows that w has a left factor yiy2--+Yp in V with y1,...,Yp-1 E X and yp = y. 
By the hypothesis, one of y1, ..., y4 must be equal to y. Thus w has also a left factor 
yy yg in V with y1... ,Yg—1 E X and y} = y. The code V being prefix, we have 


YY Yp—1 = YY? Yg 
Since X is a code, this and the assumption yı 4 y; imply that p = q = 1. But then 
Yp = Y = Yq This yields the final contradiction. n 


8.2 
Proposition as the following interesting consequence. 


THEOREM 1.5 Let X be a thin subset of At. If X has finite deciphering delay, then 
the following conditions are equivalent: 

(i) X is a maximal code, 

(ii) X is maximal in the family of codes with finite deciphering delay. 


Proof. The case where A has just one letter is clear. Thus, we suppose that Card(A) > 2. 
It suffices to prove (ii)=(i). For this, it is enough to show that X is complete. Assume 
the contrary and,co iger a word u which is not a factor of a word in X*. According 
th Proposition pate there exists v € pe that y = uv is unbordered. But then 
A*yA* N X* = Ú and by Proposition y has finite deciphering delay. This yields 
the contradiction. n 
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A word p is strongly right completable (for X) if, for all u € A*, there exists v € A* 
such that puv € X*. Clearly, an strongly right completable word is right completable. 
The set of strongly right completable words is denoted by E(X E 

The following statement is the counterpart of Theorem or codes with finite 
deciphering delay since it shows that maximal codes finite deciphering delay satisfy a 
condition which is stronger than being complete. 


PROPOSITION 1.6 Let X C At be a maximal code with deciphering delay d. Then for 
any « € X? and u € A* there exists a v € A* such that xuv € X*. In other words 


Xic E(X). 
Proof The case of a one letter alphabet js clear. Thus, assume that Card(A) > 2. Let 
x € X? and u € A*. By Proposition B.6, there is a word v € A* such that y = zuv is 


unbordered. This implies that 
X*yA*NX* #0 


.8.2 
Indeed, otherwise X U y would be a code by Proposition f zi contradicting the maxi- 
mality of X. ere 
. RA -simplifyin 
Consequently, there exist z € X*, w € A* such that zyw € X*. By Proposition E 
x is simplifying; thus, zyw = zxuvw € X* implies ruvw € X*. This shows that x is 
strongly right completable. 7 


We now state and prove the main result of this section. 


THEOREM 1.7 Any finite maximal code with finite deciphering delay is prefix. 


In an equivalent manner, a maximal finite code is either prefix or has infinite deci- 
phering delay. 
Proof We argue by contradiction and suppose that X is not a refx code. Denote by 
P the set of prefixes of the words in X*. Define (see Figure pao 


T ={tE€ P |3z,y € X, «#y and ctA* NyX* nonempty}. 


We first observe that T contains the empty word. Indeed, since X is not a prefix 
code, there exist x,y € X with y = xu for some u € AT; thus zA* N {y} is nonempty; 
this shows that 1 € T. Thus T is not empty. 


Figure 3.3: 


We show that T is finite. Let L be the maximum length of the words in X. Suppose 
by contradiction that there exists t € T of length |t| > dL, where X has deciphering 
delay d. Thus t = z1 +- zat for £1,..., £4 E X and t € P. 
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Since ztA* N yX* is nonempty, we have ztu = yw for some word w € X*. Conse- 
quently xa1---agt'u = yw, and since X has delay d, we obtain x = y. Therefore t 
cannot be in T and T is finite. 

We consider now some t in T of maximal length. We have, for some x,y E€ X, £ Æ y, 
that ztA* NyX* is nonempty; hence ztu € yX* for some word u, and we may suppose 
that u € At. Set u = au’, where a the first letter of u. We are going to show that 
ta € P, which ants ta € T, a contradiction. 


Set w wE z is a word of maximal length in the — code X. By 
Proposition fh * MN X* is none cuhpgsyetore there are £1,..., n, Y1,- --; Ym 
in X and v in A a that (see Figure pp 


L1 Ly ztav = yi: Ym. 


£1 Tn zZ t 


yı Yn 


Take n minimal. If n > 1, we have zı (z2--- £nzt)av = y1 +++ Ym and t = T2- Lnzt € 
P, since t € P. Thus xt’ A* intersects y1 X*, and since t ¢ T, we must have xı = y1. 


Thus zə- znztav = yo°-- Ym É this contradicts the minimality of n. Hence n = 0 


and ztav = y1---Ym (see Figure 


t 
yı l Yn 


Figure 3.5: fig-thMPS3 


Note that, since z is of maximal length, yı is a prefix of z. Suppose by contradiction 
that yı Æ z. Then for some prefix tı of y2:--Ym, we have yıtı = zt. Since t € P, 
the set yıt1 A* intersects zX* and we conclude that tı € T, a contradiction since 
lul < |2| = |ti] > ltl. 

Thus yı = z and tav = y2: -- Ym. Hence ta € P, which was to be proved. n 


-8.4 
The following examples show that Theorem is Optimal in several directions. 


EXAMPLE 1.3 The suffix code X = {aa, ba, b} is a finite maximal code and has infinite 
deciphering delay. 


Version 22 janvier 2008 J. Berstel, D. Perrin and C. Reutenauer 


1. Deciphering delay 171 


EXAMPLE 1.4 The code {ab, abb, baab} has minimal deciphering delay 1. It is neither 
prefix nor maximal : indeed, the word bbab, for instance, can be added to it. 


EXAMPLE 1.5 The code X = ba* is maximal and suffix. It has minimal deciphering 
delay 1. It is not prefix, but it is infinite. 


. . . .5.1ter 
The rest of this section is devoted to the proof of an analogue of Theorem BS for codes 
with finite deciphering delay. 5 Lhe following example shows that the construction used 
in the proof of Theorem ES does not apply in this context. 


.5.4b 
EXAMPLE 1.6 Let X = {a,ab}, A = {a,b} and y = bba as in Example Fone pis. tee 


Y = X Uy(Uy)* with U = A* \ X* \ A*yA* constructed in the proof of Theorem 
is a maximal code but is has infinite deciphering delay. Indeed, the word y’ = ya“bby 
is in Y and has the proper prefix ya? in Y™t, 


THEOREM 1.8 Each rational code having deciphering delay d may be embedded into a 


maximal one with the same delay d. 


Let X be nonempty code with deciphering delay d. If d = 0, X is prefix and the 
result is easy: let L be the set of preper prefixes of words in X, and let L = A* \ L be 
its complement. Let X’ = L\ LAt. Then Y = X U X' is easily seen to be a maximal 
prefix code containing X. If X is rational, the nY is rational. 

We assume in the sequel that d > 1. Let Q be the set of words having no prefix in 
X and which are not a factor of any word in X. Now, let P be the set of words in Q 
which are minimal for the prefix order: P = Q\QA*. Note that P is a prefix code. 
Moreover, words in P and X are incomparable for the prefix order. 

We say that a pair (w,p) € X* x P is admissible if w is the longest prefix in X* of 
wp. We also say that this pair is optimal if (uw, p) is admissible for any u € X*. Note 
that if (w, p) is optimal, then so is (uw, p) for u € X*. We let S” be the set of words v 
of the form v = wp with (w, p) admissible but not optimal. Then we define S = PU S’. 
Moreover, let R be the set of words v of the form v = xwp with x € X, w € X*, (xu, p) 
optimal and (w,p) admissible but not optimal. Then we define 


Y = XU RS. (3.1.2) |eq-Y 


PROPOSITION 1.9 Y is a code with deciphering delay d. 


The proof relies on a series of lemmas. 


LEMMA 1.10 If (m,p) is admissible but not optimal, there exists x' Æ x" in X, a 
factorization p = pıp2 with pı £ €, and w,v E X* such that 


x'wmp = x" vpo. 


Proof. Since (m, p) is not optimal, we may find w’,v’ € X* and a factorization p = pıp2 
with pı Æ € such that w'mp = v'p2. Choose such a relation of shortest length. Then 
w’ is nonempty, since (m, p) is admissible and v’ is nonempty because |p| > |p2|. Thus 
w = x'w, v! =2"v with w,v € X, x' x" € X. Necessarily, x’ Æ x” by minimality. n 
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p2 


Figure 3.6: An admissible but not optimal pair 


LEMMA 1.11 The set S N X4A* is empty. 


Proof. Suppose that s = ut with s € S, u € X? and t € A*. Note that s ¢ P otherwise 
some element of X is a prefix of some element of P (since d > 1), contradiction with 
the definition of P. Hence s = mp, with (m, p) admissible but not optimal. We have 
mp = ut and u cannot be longer than m, since ( Q, p) is admissible. Thus m = um’ with 
m’ € A*. Then we can find by Lemma fa Tx" in X with x2’ Æ x”, a factorization 
p = pip2 with py # € and w,v € X* such that 2’wmp = x" vpo. 

Thus z”vpo = z/wum'p pe and it follows that 2”v = x'wum’'pı, which contradicts 
the fact that X has deciphering delay d, since v € X* and u € X®. n 


LEMMA 1.12 Let u,v E€ X*, r = mp E€ R with (m, p) optimal. 
(i) ur cannot be a prefix of v. In other words, X* RA* N X* is empty. 
(ii) Ifv is a prefix of ur, not shorter than um, then v = um. 
(iii) If um is a prefix of v and if ur and v are comparable for the prefix order, then 
um =v. 


Proof. (i) Suppose that urt = v for some t € A*. Then umpt = v. Since p is not a 
factor of any word in X, we find, by decoding v € X*, that p = pıp2 with p1, p2 Æ € 
and ump, € X*, a contradiction with the fact that (m, p) is optimal. 

(ii) We have ump = ur = vt with t € A*. Since |um| < |v|, v ends with p: p = pıp2, 
ump, =v. Since (m, p) is optimal, we must have pı = € and v = um. 

(iii) Since ur and v are comparable, one of them is a prefix of the other. By (i), v isa 
prefix of ur. Since um is a prefix of v, (ii) applies, and we find v = um. n 


LEMMA 1.13 Letv € X* and let s = mp E€ S with (m, p) admissible. 
(i) s cannot be a prefix of v. In other words, SA* O X* = 9. 
(ii) Ifv is a prefix of s, not shorter than m, then v = m. 
(iii) If m is a prefiz of v and s,v are comparable for the prefix order, then m = v. 


Proof. (i) Suppose that v = st for some t € A*. Then v = mpt. Since p is not a factor 
of any word in X, we have p = pıp2 with p1, p2 4 € and v = mp1. This contadicts the 
fact that (m, p) is admissible. 

(ii) Suppose that mp = s = vt for some t € A*. Since |m| < |v], we obtain p = pıp2 
with v = mp. Since (m, p) is admissible, we must have pı = € and v = m. 

(ii) One of s and v is a prefix of the other. By (i), it must be v which is a prefix of s. 
Sice m is a prefix of v, (ii) applies and we fid m = v. n 
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LEMMA 1.14 The set X*R is a prefix code. 


Proof. Suppose that u,u’ € X*, r,r’ € R and ur is a prefix of u'r’. We write r = mp, 
r' = m'p', where (m, p), (m’,p’) are optimal. Then ump is a prefix of u’m’p’. Hence um 
is a prefix of u'm’ or conversely. Moreover, ur and u'm’ are comparable, and s are uty 
and um (since all these four words are prefixes u'r’). Hence, we find by Tema PEE 
that um = u'm’. Thus p is a prefix of p’. Hence p = p’, since P is a prefix code. a 


LEMMA 1.15 We have 
(i) SA* A X*RA* = 9. 
(ii) SA*NY* =9. 


Proof. Let s € S, r € R and v € X* be such that s and vr are comparable for the 
prefix order. We cannot have s € P since vr € X* A*. Write s = mp, r = m'p’ where 
(m,p) is an admissible pair but not optimal and (m’,p’) is optimal. Then m and um’ 
are comparable. Ss 

If vm’ is a prefix of m, since vr and m are comparable, Lemma fore cin) shows that 
um! = m. If 2 the contrary, m is a prefix of vm’, since s and vm’ are comparable, 
Lemma (Eye shows that m = vm’. So, we obtain that m = um’ in both cases. 
Since s = mp, vr = vm'p’', we find that p,p’ are comparable. Thus p = p’, since P is a 
prefix code. We conclude that s = vr. 


Since (vm’,p) = (m,p) is not optimal, we reach a contradiction with the strong 
optimality of (m‘,p’ = {m', p). 
(ii) By Lemma 1), SA* N X* = Ø. Since Y = X U RS*, we see that Y* C 
X* U X*RA*, so that (i) shows that SA*NY* =% z 


it is then necessarily a code by Proposition y contradiction, suppose that Y has 
not deciphering delay d. We may find words y1,...,Ya41, 21,---;2n in Y, w € A* such 
that 


-P1 . 
Proof of Proposition EF we only have to “EE fhat Y has deciphering delay d, since 


Y1Y2° °° Yd+1W = 21 `: Zn (3.1.3) 


with y; Æ z1. Without loss of generality, we may assume that |w| < |zn| (otherwise, zn 
is a suffix of w and we may shorten the relation by simplifying by zn). 


Since X has deciphering delay d, not all of y1,...,ya4i, 21,---;%n are in X. Moreover, 
if the zj are all in X, then some y; is in Y \ X, hence in RA*, and we choose i minimal. 
Then y1: yayı w E X*RA* and z1--+Z, E€ X*. This contradicts Lemma 1). We 


conclude that some z; is in Y \ X C RS. 
Suppose now that all y; are in X. By the length assumption on w, we have yi... Yai1 € 


Z1 +++ Zn-1A*. If one of z1, sgt 2n-1 is in Y \ X, then y1: -- yay1 E X* N X*RA*, which 
contradicts Lemma yee Thus z1,...,2n-1 E€ X and zn E€ Y \ X. Since zn € RS", 


we may write zn = cupm, with xz € X, u € X*, m € S*, (xu,p) an optimal pair and 
(u,p) admissible, but not optimal. 

We have y1: Ydp1W = 21: Zn-ıxupm. Therefore, 21 ---Z,-1vu and yı: Yas 
are com arable, for the prefix order. If z1--- Z,_1xu is a prefix of yı +- ya41, then by 
Lemma ii), they are equal. But y1 +: Yaqui = 21°°+ Zn—1@u implies yı = 21 since 
X is a code, a contradiction. 
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Thus y1 -+ < Yaşı is a prefix of z1 - -> Z,_1xu. Since yı # z1, and since X has deciphering 
delay d, we must have n = 1 and yı = x. Thus y1--- Yq, is a prefix of xu, hence 
Y2 `- Yaşı is a prefix of u, hence of up € S, which contradicts Lemma i I ii 

All this shows that some y; and some zj are not in X, hence are in RS*. Take i 
and j minimal. Then y; = ru, zj = r'u’ with f, r’ € R. Moreover yı- Yi-ır and ie 
z1 +++ Zj—-17" are comparable by Equation . We deduce then from Lemma fes 
that y1 ++ Yir = z217: 2;-17’. We may write r = amp, r' = z'm'p', where (£m, p), 
(x'm',p') are optimal pairs and (m, p), (m’, p') are admissible but not optimal. Then the 
equation y1 +: Yi-1£Mp = 2% -++ zj-1%'m'p' forces by optimality p = p' since y1,..., Yi—1, 
Z1,- ats BE mm are all in X*. Thus y1 + ym = zy- zjarm. i,j 2 2, 
then yı = zı since X is a code, a contradiction. 

It follows from this that we must have i = 1 or j = 1, i.e. yı or zı is in RS™*. 
Suppose that i = 1 and j > 1. Then we obtain zm = z1:-: zj-1%£'m', which shows 
that z = z and m = z---z;-12'm'. We know that the pair (x’m’,p) is optimal. 
Hence (z2--- z;-12'm’,p) is also optimal. Now this pair is equal to (m, p), which is not 
optimal, a contradiction. 

Thus, we cannot have i = 1 and j > 1. Similarly, we cannot have i > 1 and j = 1. ae 
Now, we have i = j = 1, i.e. y1,21 E RS*. Since R is a prefix code by Lemma ere 
we have either y1 = rs1S2, 21 = rsı Or Yi = S11, 24 = 18182 with r € R, 81,52 E€ S*, 
pon simplification by z1, 


Z2*** Zn = S292'**Yd+1W Which contradicts Lemma CIS . Thus the second case 
holds. Again by Equation ( , we have yo--- Yq41W = $221°++ Zn. To avoid the 
same contradiction, we must have that y2-:-Yq+ı is a proper prefix of s2. We deduce 
from Lemma fr that y2,---,Ya+1 are all in X. 


We may write s2 = ss3, where s E€ S, s3 E S*. Since y2 is a prefix of s2 (because 
d > 1), y2 is a prefix of s or vice-versa. Hence s ¢ P and thus s € S’. We deduce 
that we may write s = mp for some admissible but not optimal pair (m,p), and the 
existence of f,n € X*, x, 2’ € X with x £2’ such that snmp = 2’ fq with |q| < |p]. 

We know that y2--: Yayı is a proper prefix of s2 = mps3. Now, m is not a prefix 
of yo-+- Yayı (otherwise, by Lemma |I.T3 Gu). awe, deduce m = Y2: Yayı and mp E S 
has a prefix in Xĉ, contradicting Lemma [11). Thus yo---yg41 is a prefix of m. Let 
m = y2- Ya41g9. Then znyz::-yayıgp = x’ fq and because |q| < |p| and n, f € X*, 
this contradicts the fact that X has deciphering delay d. n 


PROPOSITION 1.16 The set Y is a complete code. 


If X is dense, then Y is dense and therefore is complete. So, we may assume that X 
is a thin code. The proof of Proposition Heelies on the following lemma. 


LEMMA 1.17 If X is a thin code, then the set PU (X \ XAT) is a maximal prefix code. 


Proof Let Z = PU(X\XA*). The two terms of this union are prefix codes. Moreover, 
any word in P is incomparable (for the prefix order) with any word of X. Hence Z is 
a prefix code (since € ¢ Z because X Æ Ú by assumption). 

We show that Z is right complete. Let w € A*. Suppose that w is not comparable 
with X. Choose some word u which is factor of no word in X (such a word exists 
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since X is thin). Then wu is not a factor of any word in X, and has no prefix in X. 
Therefore, wu has a prefix in P and we conclude that wA* N ZA* is nonempty. n 


-P2 
Proof of Proposition FETE Choose some word v € X4. We show that for any word w, 
vwA*NY* is nonempty (this will imply that Y is complete). By contradiction, suppose 
that 
vwA* Nn Y* =Í. (3.1.4) 


We may write vw = y1---Yynu with y; € Y and with u of minimal length amon ay 
such factorizations. Note that v is necessarily a prefix of y1--- Yn. By Lemma BEES 
find p in PU (X \ XAT) such that p and u are comparable. We claim that if pı is a 
prefix of p, then y1---Yynpi ¢€ Y*. Indeed, if y1 ---Ynpı € Y*, then since p and u are 
comparable, either pı is a prefix of u, contradicting the minimality of u, or u is a prefix 
of pı, and this contradicts Equation ES 

By the claim, p is not in X, hence p € P. Choose now i € {1,...,n +1} minimal 
such that yiYi+ı -Yn E X* (i = n + 1 will mean that only the empty product is in 
X*). We show that for any j > i, (YjYj+1**' Yn, p) is an optimal pair. Indeed, if not, 
then p = pıp2 with pı # € and yj':-Ynpı E X*. Take n+ 1 > j > i minimum with 
YjYj+1**'Ynp E S (j exists since p € S). If j > i, then yj;_1y;---ynp € R (indeed 
(Yj—1Yj °** Yn, p) is then an optimal pair, since yj—1Y; +--+ Ynpis not in S). Since RC Y, 
this contradicts the claim. 

Hence j = i. If i > 1, then y;_1 is not in X, hence is in RS*. Then y;_1y;--- ynp € 
RS* (since yi:--Ynp € S), and we find a contradiction with th claim, i 

Thus we are reduced to i = 1 and y,---ynp € S. By Lemma Fates find x, x’ in X 
distinct, such that rX*yı +--+ ynpN z’ X*pə is not empty, for some factorization p = p1p2, 
pı Æ €. Since v is a prefix of y1 +--+ Yn, this contradicts the fact that X has delay d. n 


Note that the above proof implies that a thin code X C At with deciphering delay 
d is complete if and only if for any z € X? an y & A* there is a v € A* such that 
xuv € X*. We thus obtain again Proposition -0 ~ 


PROPOSITION 1.18 If the code X is a rational, then Y is a rational code. 


Proof. Since X is rational, the set F(X) of its factors is rational. Consequently, 
Q = A* \ F(X) \ XA* is rational. Since, P = Q \ QAF, the set P is also rational. 

Let c be a new letter not in A and let m : (AUc)* — A* be the projection that erases 
c. For u,p € A*, we say that the word ucp is admissible (resp. optimal ) if so is the 
pair (u,p). We denote by So (resp S1) the sets of these words. 

Let L = (r~!(X*) mM A*cA*t)A*. Thus L is the set of words starting with a word 
z = ucw with w Æ £ and uw € X*. The set L is rational. We claim that Sp = X*cP\L, 
which implies that So is rational. 

In order to prove the claim, let ucp € So. Then evidently u € X* and p € P. 
Moreover, suppose ucp € L, then there is a factorization p = ww’ such that w Æ € 
and uw € X*, contradicting the fact that (u,p) is admissible. Conversely, if u € X*, 
p € P and ucp ¢ L, there is no prefix of up in X* strictly longer than u. Thus (u, p) is 
admissible and ucp € Sp. 

Similarly ucp € Sı if and only if u € X*, p € P and X*ucpN L = Ø. This implies 
that S1 = X*cP \ (X*)“!L is rational. 
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Figure 3.7: An automaton recognizing Y*. 


Let Ro be the set of words of the form xucp, with x € X, u € X*, which are optimal 
and such that ucp is admissible but not optimal. In other words, Ro = S1 N X (Spo \ $1). 
This shows that Ro is rational. Clearly R = 7(Ro). Recall that S” is the set of words 
of the form up with (u,p) admissible but not optimal. Consequently S” = m (So \ $1). 

This shows that S’ and R are rational. Thus S = PU S’ and Y = X U RS* are 
rational. 7 


and since a thin and complete code is maximal by Theorem [I6.9, the conclusion follows. 


3.2. last 
Note that if X is thin, then Y also is thin (Exercise i S): I hus, any thin code with 
deciphering delay d is contained in a maximal one with the same delay. 


EXAMPLE 1.7 The finite code X = {a,ab} has delay 1. We have P = {ba,bb}. The 


admissible pairs are those of the form (z,bb) and (xz,ba) with z € € U X*ab. The 
optimal ones are those with x € X*ab. Thus S = P and R = {ab?,ab’a}. Finally 
Y = {a,ab}U{ab3, ab?a}{bb, ba}* is a complete code with dec} hering delay | containing 
X. An automaton recognizing Y* is represented on Figure FF it 

Observe that there is a much simpler complete code with delay 1 conatining X, 
namely Y’ = ab*. We will see a method allowing to obtain this solution in the next 
section. 


2 Weakly prefix codes 


There is another definition, close to the previous one where one counts the delay in 
letters instead of words of the code. A set X C A* is said to be weakly prefix if there 
exists an integer d > 0 such that the following condition holds for any words x, x’ € X 
and any words y, y' E€ X* with |y| > d: if the prefix of length |x| + d of xy is a prefix 
of x'y', then x = z’. If this holds, we also say that X has literal deciphering delay d. 
The least integer d such that the implication above holds is called the minimal literal 
deciphering delay. Clearly, a weak ’ prefix set is a code. The proof is the same as 
for the verbal delay (Proposition a In the same way, a code has literal deciphering 


delay d if and only if all right comp etable wards of length d are simplifying. The proof 


is analogous to that of Proposition |[-3. 
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Figure 3.8: 


Let d be the minimal verbal deciphering delay and e be the literal one. Then 
d < e < dmax{j|z| | x € X}. 


Thus a finite set has simultaneously finite delay for both notions, but the example 
of X = b U ba*c U ařd shows that both definitions differ when X is infinite. Indeed X 
has verbal deciphering delay 1, but does not have finite literal deciphering delay since 
a” is not simplifying for any n È> 1. 

An automaton A is said to have delay d > 0 if for any pair of paths 


if |z| = d then q = q'. An automaton with finite delay is also called weakly deterministic. 
Thus a deterministic automaton has delay 0. Observe that if A has delay d, then for 
any word w, and for any pair of paths 


with |z| = d, the paths p 4 q and p & q' are equal. 


PROPOSITION 2.1 A strongly connected weakly deterministic automaton is unambigu- 
ous. 


Proof Indeed, if there are two distinct paths from p to q with the same label, then 
we may suppose that the initial edges are distinct. Since the automaton is strongly 
connected, the paths may be chosen as long as desired. Thus the automaton does not 
have finite delay. 7 


The following result proves that a code X is weakly prefix if and only if it is recognized 
by some weakly deterministic automaton A = (Q,1,1). 


PROPOSITION 2.2 Let X be a code and A = (Q,1,1) be an automaton with delay d 
recognizing X*. Then X has literal deciphering delay d. Conversely, if X has finite 
literal deciphering delay, the automaton can be chosen to have the same delay as X. 


Proof Let us first suppose that X* is recognized by A = (Q,1,1) with delay d. We 
show that X has delay d. Let x, x’ € X, let y € A* with |y| = d be a prefix of a word of 
X* and let u € A* be such that ryu = x'y' with y’ € X*. Since y is a prefix of a word 


in X*, there is a path s : 1 p. We also have two paths c: 1 1 and dc :1 5151. 
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Since A has delay d = |y| and since the label of the path cs is a prefix of the label of 
the path c’, the path c’ starts with the path c. Thus, since X is a code, x = z’. 

Conversely, let X have literal delay d and let A = (Q,i,T) be a trim deterministic 
automaton recognizing X and gt, A" = (QUw,w,w) be the automaton obtained by the 
construction of Section “i We show that A* has delay d. Arguing by contradiction, 
assume that 


a z a I1 Z } 
P>q>r, P>q >r 


with |z| = d and q # q’. Then, by construction of A* one of p,q,q is w. Clearly 
p #w. Let for example q = w. Since A* is trim, there is a path w => p and we may 
suppose tha this path does not pass by state w inbetween. We also have a path r’ > w 
(see Figure ie Then wa € X and wazv € X*. Let x = wa and let wazv = x'y' 
with z’ € X and y’ € X*. Since X has literal deciphering delay d, we have x = 2’. 


Consequently y = zv. Thus there are in A* the paths w > q' ++ w and w Sw Sw. 


Since A* is unambiguous, this implies g’ = w, a contradiction. Thus A* has delay 
d. m 


Figure 3.9: 


We may observe that the automaton A* above can be used to check whether a code 
is weakly prefix, and to compute its minimal literal deciphering delay. 

We now turn to maximal weakly prefix codes. The following result is the counterpart 
of Proposition [L6] 


PROPOSITION 2.3 Let X be a maximal code with literal deciphering delay d. Then any 
right completable word u € A* of length d is strongly right completable. 


e%06 
Proof Let v € A*. By Proog Eate exists a word w € A* such that uvw is 
Y 


unbordered. By Proposition ere exist x E€ X* and t € A* such that cuvwt € X*. 
Since X has literal deciphering delay d, and since the word u is right completable, this 
word is simplifying. Thus uvwt € X*, showing that uv is right completable. n 


An automaton A is said to be weakly complete or d-complete if for any path p & q 
with |w| = d, there is a path p & q' for each letter a € A. Observe that this path is 
not required to start with the path p + q. 

If A is d-complete, then by induction for any path p S q with |w| = d, and for any 
word z, there is a path p S g'. 
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PROPOSITION 2.4 Let X be a thin code with literal deciphering delay d and let A = 
(Q,1,1) be a trim automaton with delay d recognizing X*. The code X is complete if 
and only if A is d-complete. 


Proof Suppose first that X is complete. Let p  q be a path in A with |w| = d and 
let a € A be a letter. Since A is trim, there js path 1 p. SERR thin and 
, 


complete, it is a maximal code by Theorem 6.9% By Proposition e word uwa is 
right completable. Thus there exists a path_ = m q’. Since A has delay d and 
since |w| = d, we have p = p’ (see Figure . This shows that A is d-complete. 
Conversely, let x € X* be of length at least d. Then, for any w € A*, since A is 
d-complete, there is a path 1 “4 p. This implies that X is complete since A is trim. a 


Figure 3.10: Showing that A is d-complete. 


.8.4 

We can use the previous result to give another proof of Theorem ET Tet X bea 
finite maximal code. We argue by contradiction and suppose that its verbal delay is 
strictly positive. Since X finite, its literal delay d is also finite and strictly positive. By 
Proposition ere exists a finite d-complete automaton A = (Q,1,1) with delay d 
recognizing X*. 

We first show that we may suppose the automaton unfolded in the sense that all 
states in A except the initial state 1 have indegree 1. This property can be obtained 
by applying the following state splitting method: Let q # 1 be a state with indegree 
r > 1. This state is split into r copies, each of which with indegree 1 and with the same 
outgoing edges. Since X is finite, all cycles in A contain state 1. Consequently, the 
state splitting can be repeated only a finite number of times. Clearly, state splitting 
preserves the delay and d-completeness. 

Assume now that A is unfolded and has the minimal possible number of states. Since 
A has delay d, there is a state q such that there are edges (q,a,r) and (q,a,r’) with 
r Æ r’ and paths labeled v € A‘! going out of r,r’. Let us prove that r,r’ Æ 1. 
Arguing by contradiction, suppose that r’ = 1. Let u be a word of maximal length such 
that there is a path r S 1, decomposing as r > s Š 1 with a simple path s > 1. Let 
b be the first letter of uv. Observe that uv is nonempty since otherwise r = r’. Note 
that no path exists labeled vb and going out of 1, since A has delay d (otherwise, we 
would have two paths labeled avb starting from q with different initial edges). Consider 
now the last letter c of vu and the state t such that (t,c,1) is the last edge of the path 
r75 1. Since A is d-complete, there exists a path beled cub going out of state t. Let 
(t,c,t’) be the first edge of this path (see Figure Peeve corresponds to the case 
u # 1 and where u = u'c). We have t # 1 since there is no path labeled vb going out of 
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Figure 3.11: Showing that r’ Æ 1. 


w 


1. Let w Æ 1 be a word such that there is a simple path t’ > 1. Then there is a simple 
path s + 1 This establishes the contradiction since uw is strictly longer than u. 

Let A’ = (Q',1,1) be the automaton obtained by merging r and r’. Since r,r’ Æ 1 
and since they both have indegree 1 and the same label on the incoming edge, the 
automaton A’ also recognizes X* and is unfolded. Since it has strictly less states than 
A, we obtain the final contradiction. 

We now prove the following result. 


THEOREM 2.5 Each weakly prefix rational code can be embedded into a maximal one 
with the same delay. 


g p F orem-RCFDD A 
This theorem is a variant of, Theorem i zh The proof uses automata and it is illus- 
R sautomataComplete 
trated in Example BS 

We shall use the following lemma. In the proof, we use the notation q > to denote 


some path starting in state q, and labeled with the word u. 


-automataComplete| LEMMA 2.6 Let A = (Q,1,1) be a trim automaton with delay d. One can obtain, by 
adding finitely many states and edges to A, a trim automaton B = (Q’,1,1) which has 
still delay d and which is d-complete. 


Proof. In the case d = 0 we simply add in B all edges (q,a,1), for q # 1 anda E€ A 
when there is no edge leaving g and labeled a in A. 

The proof for d > 1 consists in several steps. 
1. We start with the definition of a new automaton By. We add the set Q’ of states 
denoted q(w), for w € A*, with 1 < |w| < d, and the edges: 

(i) q(w) S q(w’), for w = aw’, a € A, 

(ii) g(a) 4 1 for a € A. 
Denote by Bo = (QUQ’, 1, 1) this new automaton. Clearly, Bo also has delay d. Remark, 
for future use in the final step below, that each state of Q’ is coaccessible, since for 
each q(w), we have a path q(w) 4 1. 


It will be convenient to call future of a state q the set of words w of length < d such 
that there exists some path q >. Note that in Bo, the future of a state q(w) with 
|w| = d is the set of prefixes of w. 

2. We construct now a sequence of automata B,,Bo,... which all have the same 
states as Bo. It will be clear that this sequence is finite. We will show that all B; have 
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delay d. Let Bn be its last element. This will be shown to be d-complete. If B; is 
constructed and is not one, then for aur word u € Af, some letter b and some 


state q of B;, a path q > exists, but no path q Then, writing ub = aw, with 2 € Aas : 


we add to B; the edge q & q(w j, and this gives the automaton 6;+1 (see Figure 


Figure 3.12: The new edge in 6,4, (with ub = aw and the dashes indicating a non 
existing edge)) 


3. We now show a technical property: for each 7 > 0 and for each state p, the future 
of p in B; is equal to the future of p in By. This implies that for any word m € Af, the 
future in every B; of q(m) is the set of prefixes of m. 

It suffice to prove that if there is a path p > in B;+1, with |v| < d, then there exists 
already a path p > in Bi. 

For this, we may suppose that the path p > in Bj; involves the new edge q & q(w) 
created in step 2, where u is such that ub = aw, and q =. Thus, we may suppose that 
this path has the form p S q S q(w) Z p' with v = v,av2, where the last segment 


q(w) Z p' is in Bi. Now |v2| < d, thus the induction hypothesis on the future of q(w) 
implies that və is a proper prefix of w; thus, by construction of the new edge, there 
exists in B; a path q “?, since av is a prefix of u. Hence, we get in B;,1 a path p > 
with a smaller number of occurrences of the new edge. Consequently, a path p > exists 
in B;41, with no occurence of the new edge, and this path is therefore in 6;, proving 
the induction step. 

4. Suppose that B; has delay d. We prove that B;,; has the same delay. Suppose 
that for some states p, p1, p2, some letter c and some word v € Af, one has in B;+1 the 
two paths p > pı > and p È po > Because of (i), some paths pı 2 and po > exist 
in B;. If the edges p % p; and p Ġ py are in Bi, then Pi = P2 because B; has delay d. 
Otherwise, pı Æ p2, and exactly one of the two edges p Ġ pı or p È po, say p © pı, is 
the new edge q & q(w) and the other is in B;. Then p = q, c€ = a, pı = q(w), so that 
v = w by (ii) because v has length d. Thus, considering the other edge (which is in 8;), 
we see that there exists a path q “ in B;. This contradicts the assumption that led to 
the construction in step 2. 

5. Let B’ = (QU Q”,1,1) be the trimmed part of B = (QU Q’, 1,1). It has still delay 
d and we show that it is still d-complete. Assume there is a path p = in B’, and let a 
be a letter. Since B is d-complete, there is a path p & in B. Since p is accessible, each 
state on this path is accessible. Since all states in Q’ are coaccessible, all states on the 
path are both accessible and coaccessible. Thus this path is in B’. This completes the 
proof. n 


Proof of the theorem. Let X be a nonempty rational code with literal deciphering 


fig-newEdge 


a automaton 
delay d. By Proposition po there exists an unambiguous automaton A = (Q, 1,1) ‘ie Pre ee 


same delay d which recognizes X*. We may suppose that A is trim. By Lemma 
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we may embed A into a trim automaton B = (Q’,1,1) which has delay d and which is 


d-complete. 

Since $ is a strongl connected automaton with finite delay, it is unambiguous, as 
stated in Proposition aie set recognized by B’ is of the form Y*, for some ean 
rational code Y containing X. Mor over, K pas deciphering delay d, by Proposition E 
and it is complete by Proposition ppa Y is a maximal rational code with deci- 


phering delay d containing X. 


. bis.1.last o. -automaton 
’>x-automataSimple| EXAMPLE 2.1 Let X = {a,ab} as in ample - Using Proposition 2-4, we obiam 
=autre is.2.4 
| Ap O 


the automaton on the left of Figure plying the method. of Theorem 
this automaton we obtain the automaton on the right of Figure f is gives the 
complete code Y = ab* containing X. 


a 


Figure 3.13: Completion of X = {a, ab}. 


. i ig=automataDelay1 
-automataComplete| EXAMPLE 2.2 Let A be the automaton represented in Figure on the left. It has 
delay 2 and recognizes {a,aab}* which is a code with literal eciphering delay 2. 
The automaton Bo is represented in Figure on the right (we denote the new states 
A 7 SuN ; g =automatonDelay2 
w instead of q(w) for simplicity). The final automaton BG is represented on Figure f tsj 


after removal of the states which are not accessible. 


Figure 3.14: The automata A and Bo fig-automataDelay1 


3 Exercises 


. section2bis.1 
Section 


1.1 Show that the deciphering delay of a code X is infinite and only if there is an 
infinite path in Gx starting in a vertex in X. If X is finite, this happens if and only if 
there is a cycle in Gx that is accessible from some vertex in X. 
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Figure 3.15: The automaton B 


1.2 (i) Show that a code X has deciphering delay d if any disjoint factorizations 
L1°°+XLnp = Y1 Ym, Where z1,..., En, Y1,- -Ym are words in X and p is a prefix of a 
word in X, satisfy n < d. 

(ii) Let e,---e, be the sequence of edges of a path from x € X to t in the prefix 
graph of a code X. The occurrence e; is called even (odd) if the number of crossing 
edges among €1,...,¢€; is even (odd). Show that in the double factorization ry; --+ yet = 
T1’: Lp OY TYI: Ye = L1: Lkt corresponding to this path, Z is the number of even 
edges and k is the number of odd edges. 

(iii) Describe a linear time algorithm for computing the deciphering delay, assuming 
that there is no cycle in the prefix graph. 


1.3 Let Y and Z be composable codes with finite deciphering delay d(Y) and d(Z). 
Show that X = Y o Z has finite delay d(X) < d(Y)+d(Z). (Hint: Show that for 
ye XU), z e XU), we have yz € S(X).) 


1.4 Let X = {x,y} be a two-element code. Show that X has finite deciphering delay. 


1.5 Let X C A* be a finite code. 


(a) Show that there exists a smallest submonoid M containing X* such that M 
is generated by a code with finite deciphering delay. 
(b) Let Y C A* be the base of the submonoid e sis" is asserted in (a). 


Show by a proof analogous to that of Proposition [I at 
Pex) ne) x 
Deduce from this that if X does not have finite deciphering delay, 


Card(Y) < Card(X) — 1. 


1.6 Show that a code X has verbal deciphering delay at most d if and only if the code 
X” has verbal deciphering delay at most 1. 


1.7 Let X C At be a code. 
Show that if both the sets E(X) of strongly right completable words and S(X) of 
simplifying words are nonempty, then they are equal. 
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tion is thin. 


. section2bis.2 
Section 


2.1 We say that an automaton has bidelay (d,d’) if it has left delay d and right delay 
d'. In the same way, we say that an automaton is (d, d’)-complete if it is left d-complete 
and right d’-complete. We introduce a new concept to work with automata with finite 
bidelay. 

An extended automaton with delay (d, d’) is an automaton on a set of states Q where 
the set E of edges, in addition to ordinary edges, includes boundary edges. A forward 
boundary edge has an origin g € Q and a label a € A but no end. A backward boundary 
edge has a label a € A and an end q € Q but no origin. We extend the notion of a path 
by admitting that a path may possibly begin with a backward boundary edge and end 
with a forward boundary edge. We denote by F'(p) the set of edges starting at p and 
by P(p) the set of edges ending at p. We denote by X(e) the label of the edge e. 

Each state q of an extended automaton has attached to it a pair (Uq, Vq) where Ug 
is a set of words of length d and V, is a set of words of length d’. Similarly, each edge 
e has such a pair (Uc, Ve) C A? x A“. These are subject to the following compatibility 
conditions. 


1.8 ao that if X is a thin code with delay d, then the code Y defined by Equa- 


1. For each state p the family of sets A(e)Ve for e € F(p) forms a partition of the 
set V,A. 
2. For each state p and each edge e € F(p), Up = Ue. 
3. For each state q, the family of sets UeA(e) for e € P(q), forms a partition of the 
set AU. 
4. For each state q and each edge e € P(q), Vz = Ve. 
Show that the two following objects coincide: 
(i) an extended automaton with delay (d,d’) without boundary edges. 
(ii) a (d,d’)-complete automaton with bidelay (d,d’) with Up (resp. Vp) equal for 
each state p to the set of labels of paths of length d (resp. d’) ending at p (resp. 
starting at p). 


2.2 Define, for a state p of an extended automaton, the noncommutative polynomial 
(p) = UpVpA — AUpVp, 


and for an edge e 
O(e) = EUe Ale) Ve 


with £ = 1 if e is a forward boundary edge, € = —1 if e is a backward boundary edge, 
and £ = 0 otherwise. Show that 


YA) = Fae). 


pEQ ecE 


Derive that the sum of O(e) for all boundary edges, called the balance of the automaton, 
belongs to the lattice £ generated by the polynomials fw = wA — Aw for w € ATH, 
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2.3 Show that the following labeled graphs satisfy the definition of an extended au- 
tomaton. 
1. The automaton Apo with set of states Q = Add with Uy, = u and Vaw = v for 
ae Af vE AŤ. The set of edges is AITH with Unay = u, A(uav) = a and 
Vuav = v. Moreover, F (uv) = uvA and P(uv) = Awv. 
2. The automaton A_, obtained from Ao by deleting the single state x. Show that 


in A-z, 
To=- 
e€E 
3. The automaton A, obtained from Ag by deleting all edges except those incident 
to state x. Show that in Az, 


5 a(e) = 


ecE 


2.4 An edge e of an extended automaton is said to be simple if Ue and Ve have 
just one element. Show that, by adding finitely many states and edges, any extended 
automaton can be transformed in such a way that all boundary edges are simple. 


2.5 Show that any extended automaton can be embedded into one having no bound- 
ary edge. 
hint: First assume that all boundary edges are simple. Write $ ecg 3le) = do bz fz 
where the coefficients by are integers. If by > 0 add by copies of A_,, and if by < 0, 
add by copies of As. The resulting extended automaton is such that $` 0(e) = 0. Fi- 
nally merge each forward boundary edge e with a backward boundary edge e’ such that 


A(e) + Ae’) =0 


2.6 The aim of this exercise is to show that any rational code with finite literal delay 
in both directions is included in a maximal one. 

Let A = (Q,1,1) be an automaton with bidelay (d,d’). We use a series of steps to 
transform A into an automaton with the same bidelay which is (d,d')-complete. Show 
that if A is an automaton with bidelay (d,d’), one may first define the pairs (Ug, Vg) 


and then add boundary edges to 0 gbtain an extended automaton. 
Conclude, using Exercise Baa that any code with literal bidelay (d, d’) can be embed- 
ded into one with the same literal bidelay. 
tors endodin ona.on 
2.7 o the automaton with bidelay (1,1) of Figur Ayh the leit. Show 
the (1, E nda maton constructed as in Exercise e one A sonteal in in 
: tende yomaton, 
i e right. 
4 Notes 


The notion of deciphering delay appears at the very beginni g of the theory of codes 


eae and Moord ((1959); (L964) ). oo $ ue to|Schiitzenberger 
(1966). It was conjectured in Gilbert and Moore (L959 . An incomplete proof : appears 


in I Markov Markov] (1962). A proof of a result which is more m than Theorem as been 
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Figure 3.16: Automata with bidelay (1, 1) 


given in Bchützenberger ((1966)). A characterization of the coding morphism for a code 
with finite deciphering delay js given in (L979). 
The proof of Theorem [l.f presented here is due to Véronique Bruyère (see 
(L992) or Chapter 6 of) a (2002) ). 
theorem-RCFDÐ i 
REEK due to |Bruyère et al.| (1990). We have followed their proof except for 


Proposition |[.18} 

Concerning rational relations, we refer the reader to (L979) and 
for relevant results. 

The notion of automaton with finite delay is known in early automata theory as 
information lossless of finite order (see ({L978)). It is related with the notion 
of a right closing map in symbolic dynamics (see (1995). The term 


was introduced by Kitchens (L981). Theorem ran is due to Véronique Bruyére 
(L992). lemna-automataÃonplets 
Psnley et aly 


The construction of Lemma 2 ZD 1s from 


(1993). We have followed t 


presentatio of B Bruyère and Latteux (1996| 
a ka 1S trom Berstel et al] Berstel et al] (1979). An analogous result is proved in 
T To is from IN 1966). 
exg-autbnat airptionin retten 
Ashley et al] ({1993), in which extended automata are 
introduced and called molecules . This name is used metaphorically and refers to the 


(L981) lexo-ExtendahJ 
Exercise imi is from (L966). 
Exercises Z.T O 2.0 are from 
possibility to use the boundary edges as bindings. 
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Chapter IV 


Bifix Codes 


0 Introduction 


The object of this chapter is to describe the structure of maximal bifix codes. This 
family of codes has quite remarkable properties and can be described in a rather satis- 
factory manner. 

As in the rest of this book, we will work here within the family of thin codes. As 
we will see, this family contains all the usual examples, and most of the fundamental 
properties extend to this family when they hold in the simple (i.e., finite or recognizable) 
case. 

To each thin maximal bifix code, two basic parameters will be associated: its degree terá 
and its kernel. The degree is a positive integer which is, as we will see in Chapter P 
the degree of a permutation group associated with the code. The kernel is the set of 
code words which are proper factors of some code word. We shall prove that these two 
parameters characterize a thin maximal bifix code. 

In the first section, we introduce the notion of a parse of a word with respect to a 
bifix code. It allows us to define an integer-valued function called the indicator of a 
bifix code. This function will be quite useful in the sequel. 

In the second section, we give a series of equivalent conditions for a thin code to 
be maximal bifix. The fact that thin maximal bifix codes are extremal objects is 
reflected in the observation that a subset of their properties suffices to characterize 
them completely. We also give a transformation (called internal transformation) which 
preserves the family of maximal bifix codes. 


187 


ex3.1.0 


ex3.1.0bis 


ex3.1.0ter 


ex3.1.0quater 
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ection3.3 

Section B contams the definition of the degree of a thin maximal bifix code. It is 
defined as the number of interpretations of a word which is not a factor of a code word. 
This number is independent of the word chosen. This fact will be used to prove most 
of the fundamental properties of bifix codes. We will prove that the degree is invariant 
under internal transformation. 

In the fourth section, a construction of the thin maximal bifix code having a given 
degree and kernel is described. We also describe the derived code of a thin maximal bifix 
code. It is a code whose degree is one less than the degree of the_o igi al code. Both 
constructions are consequences of a fundamental result (Theorem 3) which character- 
izes those sets of words which can be completed in a finite maximal bifix code without 
modification of the kernel. 

Section His devoted to the study of finite maximal bifix codes. It is shown that for a 
fixed degree and a fixed size of the alphabet, there exists only a finite number of such 
codes. Further it is proved that, on this finite set, the internal transformation acts 
transitively. 


1 Basic properties 


A bifiz code is a subset X of A* which is both prefix and suffix. In other words, we 
have 
XATNX =9, ATXNX =b. (4.1.1) 


EXAMPLE 1.1 Any code X composed of words of the same length is bifix. 


EXAMPLE 1.2 Let A be an alphabet containing two distinct letters a,b. Any set X = 
aU bYb with y C (A \ b)* is bifix. 


EXAMPLE 1.3 If X,Y are bifix codes, then XY is a bifix code. 


EXAMPLE 1.4 Let A = {a,b}. By inspection, the set 
X = {a, aba, a7b?, ab, ba”, baba, bab? , ba, b?} 
appears to be a bifix code. It will appear at several places later. 


1 
An X-parse (or simply a parse) of a word w € A* is a triple (v, x,u) (see Fig. ery 
such that w = vzu and 


v € A“ — A*X, re xX", u € A*—XA*. 
An interpretation of w € A* is a triple (v, x,u) such that w = vru and 
vEA X, re xX", u E€ XA. 


Since the code X is bifix, we have ATX C A*— A*X, and XA~ C A*— X A*, thus any 
interpretation of w is also a parse of w. 

A pointin a word w € A* is a pair (r,s) € A* x A* such that w = rs. A word w 
thus has |w| + 1 points. A parse (v, x,u) of w is said to pass through pero (r,s) 
provided x = yz for some y, z E€ X* such that r = vy, s = zu (see Fig. 
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aan a 
wW o \ O 
V X u 


Figure 4.1: An X-parse of w. 3_01 
A a 
o— % U ọ 
| | l 
| | l 
ò w $ ò 
Figure 4.2: A parse of w passing through the point (r, s). 3_02 


PROPOSITION 1.1 Let X C A® be a bifix code. For each point of a word w € A*, there 
is one and only one parse passing through this point. 


Proof. Let (r,s) be a point of w € A*. The code X being prefix, there js, a unique 
z € X*, and a unique u € A* — XA* such that s = zu (Theorem [II1-4). Since X is 


suffix, we have r = vy for a unique v € A*—A*X and a unique y € X*. Clearly (v, yz, u) 
is a parse of w passing through (r,s). The uniqueness follows from the uniqueness of 
the factorizations of s and r. n 


PROPOSITION 1.2 Let X C A? be a bifix code. For any w € A*, there are bijections 
between the following sets: 
1. the set of parses of w, 
2. the set of prefixes of w which have no suffix in X, 
3. the set of suffixes of w which have no prefix in X. 


Proof. Set V = A* — A*X, U = A* — XA*. For each parse (v,x,u) of w, the word 
v is in V and is a prefix of w. Thus v is in the set described in 2. Conversely, if 
w = vw and v € V, set w = xu with x € X* and u € U (this is possible since X is 
prefix). Then (v, x,u) is a parse. The uniqueness of the factorization w’ = xu shows 
that the mapping (v, x,u) +> u is a bijection from the set of parses on the set described 
in 3. 7 
Let X be a bifix code over A. The indicator of X is the formal power series Lx (or 
simply L) which associates to any word w the number (L, w) of X-parses of w. 
Setting U = A* — X A*, V = A* — A* X, we have the result 


L=VxX*U (4.1.2) 


Note that X A* = XA* since X is prefix, and A*X = A*X since X is ie Thus 


, we 


U=A*—-XA* =(1-X)A* andV = A*(1— X). Substituting this in ( 


obtain 
L= A*(1- X)A*. (4.1.3) 
L=V A*=A*U. (4.1.4) 


.1.2 
Note that this is a compact formulation of Proposition i “xt 


This can also be written as 
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33 
From formula (ETS we obtain a convenient expression for the number of parses of 
a word w € A*: 


(L,w) = [w| +1 —(A*X A*,w). (4.1.5) 

The term (A*X A*,w) equals the number of occurrences of words in X as factors of 

w. Thus we see from that for any bifix codes X,Y the following implication 
holds: 

Y C X => Lx < Ly (4.1.6) 


(Recall that the notation Lx < Ly means that (Lx,w) < (Ly, w) for all w in A*). 


PROPOSITION 1.3 Let X C At be a bifiz code, let U = A* — X A*, V = A* — A*X, 
and let L be the indicator of X. Then 


1-X =(1-A)L(1- A). (4.1.8) 


T 4 8 
rae Formula (ft follows from Ei and is an immediate consequence of 


PROPOSITION 1.4 Let X C At be a bifix code and let L be its indicator. Then for all 
w E€ A* 
0< (L,w)-1< Ju. (4.1.9) 


In particular, (L,1) = 1. Further, for all u,v,w € A*, 
(L,v) < (L, uvw). (4.1.10) 


Proof. For a given word w, there are at most |w| + 1 and at Jeast one (namely, the 
empty a gs of w which have no suffix in X. Thus ( Fis a consequence of 


Proposition [CZ 
Next any parse of u can be extended to a se of uvw. Thi rse of uvw is uniquely 
determined by the parse of v (Proposition ite ihis shows Ee n 


EXAMPLE 1.5 The indicator L of the bifix code X = 0) satisfies (L, w) = |w| + 1 for all 
we A*. 


EXAMPLE 1.6 For the bifix code X = A, the indicator has value (L, w) = 1 for all 
we A*. 


The following proposition gives a characterization of formal power series which are 
indicators. 


PROPOSITION 1.5 A formal power series L € Z((A)) is the indicator of a bifix code iff 
it satisfies the following conditions: 
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(i) For alla € A,w € A*, 


0 < (L,aw) — (L,w) < 1, (4.1.11) 
0 < (L, wa) — (L, w) < 1. (4.1.12) 


(ii) For alla,b € A and w € A*, 


(L, aw) + (L, wb) > (L, w) + (L, awb). (4.1.13) 


(iii) (L,1)=1. 


Proof, Assume that L is the indicator of some bifix code X. It follows from formula 
(era that the coefficients of the series L(1— A) and (1 — A)L are 0 or 1. For a word, 
(eta 


w € A* and a letter a € A, we have (L(1 — A), wa) = q) — (L, w). Thus, 
holds. The same holds for ae Finally, formula ( gives for the empty word, 
the equality (L,1) = 1, and for a,b € A, w € A*, 

—(X_, awb) = (L, awb) a (L, aw) B (L, wb) T (L, w), 


13 
showing (fri 


Conversely, assume that L satisfies the three conditions. Set S = (1 — A)L. Then 
(S,1) = (£,1) = 1. Next for a € A, w € A*, we have 


(S,aw) = (L,aw) — (L, w). 


11 
By EnD 0 < (S,aw) < 1, showing that S is the characteristic i g some set U 


containing the empty word 1. Next, if a,b € A, w € A*, then by ( 


(S,aw) = (L, aw) — (L,w) > (L, awb) — (L, wb) = (S, awb). 


Thus, awb € U implies a pwing that U is prefix-closed. 

According to Theorem ie ze set X = UA — U is a prefix code and 1 — X = 
U(1— A). 

Symmetrically, the series T = L(1— A) is the characteristic series of some nonempty 
suffix-closed set V, the set Y = AV — V is a suffix code and1—Y =(1—A)V. 

Finally 


t= X =00=A} 


II 
= 
| 
D 
= 
| 
D 
|l 


(Ay =1-F. 


Thus, X = Y and X is bifix with indicator L. n 


2 Maximal bifix codes 


A code X C A* is a maximal bifix code if, for any bifix code Y C A*, the inclusion 
X C Y implies that X = Y. As in the preceding chapter, it is convenient to note that 
the set {1} is a maximal bifix set without being a code. We start by giving a series of 
equivalent conditions for a thin code to be maximal bifix. 
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PROPOSITION 2.1 Let X be a thin subset of At. The following conditions are equiva- 
lent: 

(i) X is a maximal code and bifix. 

(ii) X is a maximal bifix code. 

(iii) X is a maximal prefix code and a maximal suffix code. 

(iv) X is a prefix left complete code. 

iv’) X is a suffix right complete code. 

(v) X is a left complete and right complete code. 


Proof. (i) = (ii) is clear. (ii) = (iii). If X is maximal prefix, then by Theorem HET 
is a maximal code, therefore X is maximal suffix. Similarly, if X is maximal suffix, it 
is maximal prefix. Thus, assume that X is neither maximal prefix nor maximal suffix. 
Let y,z E€ X be such that X Uy is prefix and X U z is suffix. Since X U yt is prefix for 
any word t, it follows that X U yz is prefix, and so also bifix. Moreover, yz ¢ X (since 


otherwise X U y would not be prefix). This es (ii). 

(iii) > (iv’) is a consequence of Theorem stating that a maximal prefix code is 
right-complete (similarly for the implication (iii) > (iv)). 

(iv) = (v) The code X is complete and thin. Thus, it is maximal. This shows that 
it is maximal prefix, which in turn implies that it is right c let 7 

(v) = (i) A complete, thin code is maximal. By Theorem FRESE complete thin 
code is prefix. Similarly, X is suffix. m 


A code which is both maximal prefix and maximal suffix is always maximal bifix, 
and the converse holds, as we have seen, for thi godes. However, this may become 
false for codes that are not thin (see Example . 


iin . 2 
EXAMPLE 2.1 A group code is a bifix and a maximal code (Section i 


EXAMPLE 2.2 Let A = {a,b} and 
X = {a®, a*ba, a?b?, ab, ba”, baba, bab”, ba, b°}. 


By inspection of the literal representation (Fig. ES. X is seen to be a maximal prefix 
code. ä 

The reverse code X is also maximal prefix (Fig. H. Thus X is a maximal bifix 
code. Observe that X is equal to the set obtained from X by interchanging a and b 
(reflection with respect to th oțizontal axis). This is an exceptional fact, which will 
be explained later (Example f i 


EXAMPLE 2.3 Let A = {a,b} and 
X = {wabl”! | w € A*} 


dr 1 
(see Examples ioana HE i is a maximal, right-dense code which is suffix but 


not prefix. The set 


Y=X-XAt 
is maximal prefix and suffix but not maximal suffix sinc # X. Thus, Y is also 
maximal bifix, satisfying condition (ii) in Proposition ‘without satisfying condition 


(iii). 
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Figure 4.3: The literal representation of X. 


Figure 4.4: The literal representation of X. 


The following result gives a different characterization of maximal bifix codes within 
the family of thin codes. 


PROPOSITION 2.2 A thin code X is maximal bifix iff for all w € A*, there exists an 
integer n > 1 such that w” € X*. 


Proof. Assume that for all w € A*, we have w” in X* for some n > 1. Then i gar 
is right-complete and left-complete. Thus, X is maximal bifix by Proposition B.I| 

Conversely, let X be a maximal bifix code, and let w € A*. Consider a word u € 
F(X), i.e., which is not a factor of a word in X . The code X being right-complete, 
for all i > 1 there exists a word v; such that 


w'uvi E X* 


Since u € F(X), there exists a prefix s; of u such that ws; € X*. 

Let k,m with k < m be two integers such that są = sm. Then setting n = m — k, we 
have wsp € X*, w™ Sm = w'wks, € X*. 

Since X* is left-unitary, this implies that w” € X*. n 

We now describe an operation which makes it possible to construct maximal bifix 
codes by successive transformations. 


PROPOSITION 2.3 Let X be a code which is maximal prefix and maximal suffix, and let 
w E A*. Set 
G=Xwt, Daw eX, 


J. Berstel, D. Perrin and C. Reutenauer Version 22 janvier 2008 


3_03 


194 IV. Bifix Codes 


Go =(wD)w!, = Do=w '(Gu) (4.2.1) 


6-66. B= Dd; 
If Gi 40 and Dı #9, then the set 


y= (X U w U Gı(wD%)D1) = (Gw U wD) (4.2.2) 


is a maximal prefix and maximal suffix code. Further, 


Y=X+(1-G)w(l- DoD1) (4.2.3) 


Proof. If Dı 4 Ø, then D is nonempty. Further 1 ¢ D, since otherwise w € X, and 

X being bifix, this implies G = D = {1}, and Do = {1} and finally Dı = i : s 
contradiction. Thus, w is a proper prefix of a word in X , and by Proposition [TI-6, 
the sets D and 


Yı = (X U w) —- wD 


are maximal prefix codes. 
Next, Gw = X N A*w, also Gow = wD N A*w; similarly for D and Do . 


Thus, 
wA* N A*w N X = Gw N wD = wDo = Gow. (4.2.4) 
4 
Now note that G = Go UG}. From this and (Raat we get 


GuUwD = GowUG,wUwD 
= wD) UG wUwD 
= GywUuD, 


since Do C D. Similary 
Gw U wD = GwUwD, 


Thus 
Y= (Yı U GıwD% Dı) — Gw. 


Since D = Dı U Do is a max 
prefix code (Proposition [A 
according to Proposition [M] 
Symmetrically, it may be shown successively that Yə = 7 ae w) — wG and Y’ = 
(Yo -wD,)UG,G)wD, are maximal suffix codes. From ( - we obtain by induction 
that Gow = wD., Thus, Y’=Y and consequently Y is also maximal suffix. 
To prove (SoH 


prefix code and Dı # Ø, the set Dj.D is a maximal 
T his and the fact that Yı is maximal prefix imply, 


set 
o=X +(1-G)w(l- DoD) 


Then 
g=X +w-Gu—-wDj)N1+Gwd D1, 


o=X+w-Gw-wD§D1+GowDiD1+GiwD sD. 
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Since Gow = wD o, we obtain 


o = X+w-Gw—wD§Dit+wDoD§Dit GwDiD1 


>< | 


= +w-Gw-wDı+GıwDDı 
The sets Giw, Do, Dı are prefix, and Do # 1 (since otherwise w € X) us, the 
products in the above expression are unambiguous. Next it follows from ( that 
GiwN wD =Í. 
Consequently 
GwUwD =Giw+uD. 
Thus 


o =X +w+G,wDo D, —- GwUwD =Y. 
E 


The code Y is said to be obtained from X by internal transformation (with respect 
to w). 


RENE 2.4 Let A = {a,b}, and consider the uniform code X = A?. Let w = a. 
G = D = A and Go = Do = {a}. Consequently, the code Y defined by formula 


Y = a U ba*b. 
Note that Y is a group code as is X . 


2 
From formula (i it is clear that for a finite code X , the code Y is finite iff 
Do =. This case deserves particular attention. 


PROPOSITION 2.4 Let X be a finite maximal bifix code and let w E€ A*. Set 
G= Xw, D=wX (4.2.5) 
IfG#0, D#0 and GwNwD =}, then 
Y = (X U wU GwD) — (Gw U wD) (4.2.6) 
is a finite maximal bifiz code, and 
Y =X +(G= lw) -1). (4.2.7) 


Conversely, let Y be a finite maximal bifix code. Let w € Y be a word such that there 
exists a maximal prefix code D, and a maximal suffix code G with 


GwD CY. 


Then 
X = (Y — w — GwD) U (Gw U wD) (4.2.8) 


is a finite maximal bifiz code, and further (Bear EEE and (24 hold. 
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Proof. If Gw N wD = hen we have with the notations of Proposition 2-3, 
Do = Ú by formula . Then (4.2.4) simplifies into ( Formula (#-2-7) is a 


direct consequence of formula (om 
Conversely, let us first show that Xi is a maximal prefix code. Set 


Z= (Y —w)UwD 


. . è re = 2 bd 1 . . . 
Since Y is an by Proposition Bafana since D is maximal prefix and w € Y, 
I 


Corollary mplies that the set Z is a maximal prefix code. Next observe that 
X = (Z — GwD) U Gw. 


The set Q = Gw is contained in ZAT, since Gw C (X — w)A~. Next Q is prefix: 
assume indeed that gw = g'wt for some g,g' € G, t € A*. Let d be a word in D of 
maximal length. 

The set D being maximal prefix, either td € DAT, or td € DA* or finally td € D. 
The two first cases are ruled out by the property of Y to be prefix. Thus, td € D, and 
since d has maximal length, we get t = 1. This proves the claim. 

Further, for all g € G, we have D = (gw)~'Z. Indeed, the inclusion gwD C Z implies 
D C (gw)7!Z, and D bejng a imal prefix code, the equality follows. 

In view of Proposition Fikk T set X consequently is a maximal prefix code. Sym- 
metrically, it may be shown that X is maximal suffix. Since X is finite, it is maximal 
bifix. 

It remains to show that Y is obtaj q from X by internal pee matio First, the 
inclusion Gw C X follows from ( ` implying G C Xw7!, and G being a maximal 
suffix code, this enforces the equality 


G = Xw! 


Symmetrically D = w™tX. Moreover, G 40, D 4 0, because they are maximal codes. 
Let us show that 
GwN wD =f. 


If gw = wd for some g € G, d € D, then ggw = gwd € GwD C Y. Thus w,ggw € Y; 
this is impossible, since Y is suffix. 

From w € Y we get the result that Gw N Y = @; otherwise Y, quld not be suffix. 
Similarly wDNY = 0, because Y, ig prefix. Then as a result of ( |, X —(GwUwD) = 
Y — (w U GwD), implying ERE 


EXAMPLE 2.5 Let A = {a,b} and X = A®. Co sjder the word w = ab. Then G = 
D = A and Gw N wD = 90. Thys Proposition EE a finite code Y. This code is 
obtained by dropping in Fig. he dotted lin and by adjoining the heavy lines. The 
result is the maximal bifix code of Example ag ee 


3 Degree 


In this section, we examine the indicator of thin maximal bifix codes. For these bifix 
codes, some simplifications occur. 
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Figure 4.5: An internal transformation. 3_05 


Let X C At be a bifix code, set U = A* — X A*, V = A*— A*X and let L = V X *U 
be the indicator of X . If X is a maximal prefix code, then U = P where P = X A7~ 
is the set of proper prefixes of words in X . In the same way, for a maximal suffix 
code, we have V = S where S = A` X is the set of proper suffixes of words in X . 
It follows that if X is maximal prefix and maximal suffix, each parse of a word is an 
interpretation. Then we have 


L=SX*P=SA*=A'P. (4.3.1) 


This basic formula will be used frequently. It means that the number of parses of a 
word is equal to the number of its suffixes which are in P, or equivalently the number 
of its prefixes which are in S. Let X be a subset of At. Denote by 


H(X) = AXA = {we A* | ATwAtTNX #0} 
the set of internal factors of words in X . Let 
H(X) = A* — H(X) 


Clearly, each internal factor is a factor of a word in X . The converse may be false. 
The set H(X) and the set 


F(X) = {w € A* | A*wA* OX #0} 
of factors of words in X are related by 
F(X) = H(X)UXA UA XUX 
and for F(X) = A* — F(X), 
AtH(X)At c F(X) c H(X) 


These relations show that H(X) is nonempty iff F(X) is nonempty; thus X is thin iff 
H(X) #90. 


THEOREM 3.1 Let X C A®* be a bifix code. Then X is a thin maximal code iff its 
indicator L is bounded. In this case, 


H(X) = {w € A*|(L,w) = d} (4.3.2) 


where d is defined as d = max{ (L, w) | w € A*}. 
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formula (L,ww’) = (S A*,ww’). Thus the number of parses of ww’ is equal to 
the number of prefixes of ww’ which are in S = ATX. Since w € H(X), it follows that 
no such prefix in S is stri tly longer than w. Thus all these prefixes are prefixes of w. 
Again using formula , this shows that 


Proof. Let 3 : a thin maximal bifix code. Let w € H(X) and w’ € A*. According to 


(L, ww") = (L,w). 
1.4 
Now by Proposition [4 we have (L, ww’) > (L,w’). Thus we get 
(L, w’) < (L,w) 


showing that L is bounded on A* by its value for a word in H(X). This shows also 
that L is constant on H(X). Thus 


H(X) c {w € A* | (L,w) = d} 
To show the converse inclusion, consider an internal factor w € H(X). Then there 
exist p,s € AT such that w’ = pws € X. This implies that, 


(L, w) > (L,w) +1 


Indeed, each parse of w can be extended in a parse of w’, and w’ has an additional 
parse, namely (1, w’,1). This shows that for an internal f oy w, the number (L, w) is 
strictly less than the maximal value d. Thus formula ( "is proved. 
Assume now conversely that X is a bifix code with bounded indicator L, let d = 
a {(4, w) | w € A*} and let v € A* be a word such that (L, v) = d. We use Formula 
E hich can be rewritten as 


XA*=A*+(A-DL. 
Let w € At be any nonempty word, and set w = au, with a € A, u € A*. Then 
(X A*, wo) = (A*+(A — 1)L, auv) = 1 + (L, wv) — (L, auv). 


By Proposition E toth (L,uv) and (L, auv) are greater than or equal to (L, v). By 
the choice of v, we have (L, uv) = (L, aww) = d. 

Thus (X A*, wv) = 1. Thus we have proved that for all w € At, wv € XA*. This 
shows that XA* is right dense. This shows also that X is thin. Indeed, we have 
v € H(X) since for all g,d € At we have gv € XA* and therefore gud ¢ X. Thus X is 
a thin maximal prefix code. Symmetri ally, it can be shown that X is maximal suffix. 
This gives the result by Proposition p i n 

Let X be a thin maximal bifix code, and let L be its indicator. The degree of X, 
noted d(X) or simply d, is the number 


d(X) = max{ (L, w) | w € A*} 


.3.1 
According to Theorem Bat the degree d is the number of parses of any word which is 
not an internal factor of X. Before going on, let us illustrate the notion of degree with 
several examples. 
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EXAMPLE 3.1 Let y be a morphism of A* onto a group G, and let G” be a subgroup 

of G. Let X be the group code for which X* = y!(G’). We have seen that, X A 

maximal bifix code, and that X is thin iff G’ has finite index in G (Beample RS 

The degree of X is equal to the index of G’ in G. Indeed let w € H(X) be a word 

which is not an internal factor of X, and consider the function Y% which associates, 

to each word u € A*, the unique word p € P = XA™ such that ww € X*p. Each p 

obtained in such a way is a proper suffix of w. The set 7(A%*) is the set of suffixes of w 

in P. Since w € H(X), we have Card y(A*) = d(X). Next, we have for u,v € A*, 


y(u) = plv) + G'd(u) = G'dlv). 


Indeed, if y(u) = Yv(v) = p, then uw,vw E€ X*p, and consequently y(u),y(v) € 
G'y(p)p(w)!. Conversely, if G’y(u) = G'yp(v), let r € A* be a word such that 
uwr € X*. Then y(vwr) € G’y(u)y(wr) c G’, whence vwr € X*. Thus one of ¢(u)r 
and ~(v)r is a suffix of the other. Since X is a suffix code, it follows tha i = 

This shows that the index of G’ in G is d(X). By Proposition X) is also 


equal to the degree of the permutation gr YP corresponding to the aeon of G on the 
cosets of G”, as defined in Section 


EXAMPLE 3.2 The only maximal bifix code with degree 1 over A is X = A. 


EXAMPLE 3.3 Any maximal bifix code of degree 2 over an alphabet A has the form 
X = C U BC*B, (4.3.3) 


where A is the disjoint union of B and C, with B £ 0. 

Indeed, let C = AN X and B = A — C. Each b € B has two parses, namely (1, 1, b) 
and (b,1,1). Thus, a word which is an internal factor of a word x € X cannot contain 
a letter in B, since otherwise x would have at least three parses. Thus, the set H of 
internal factors of X satisfies H C C*. Next consider a word x in X. Either it is a 
letter, and then it is in C, or otherwise it has the form x = aub with a,b € A and 
u E€ H C C*. X being bifix, neither a nor b is in C. Thus X C CU BC*B. The 
maximality of X implies the equality. 

This shows that any maximal bifix code of degree 2 is a group code. Indeed, the code 
given by eas. obtained by considering the morphism from A* onto Z/2Z defined 


by (B) = {1}, (C) = {0}. 
EXAMPLE 3.4 Consider the set 
Y = {ab |n > 1}. 


It is a bifix code which is not maximal since Y U ba is bifix. Also Y is thin since 
ba € F(Y). The code Y is not contained in a thin maximal bifix code. Suppose indeed 
that X is a thin maximal bifix code of degree d containing Y. For any n > 0, the word 
a” then has n + 1 parses, since it has n + 1 suffixes which all are proper prefixes of a 
word in Y, whence in X. Since d < n, this ig impossible. In fact, Y is contained in the 
Dyck code over {a,b} (see Example pee 
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EXAMPLE 3.5 Let X,Y C At be two thin maximal bifix codes. Then XY is maximal 
bifix and thin and 
d(xY) =d(xX)+d(Y). 


The first part of the claim follows indeed from Corollary MEN Next, let w € H(XY) 
be a word which is not an internal factor of XY. Then, w € H(X) and w € H(Y). 
The prefixes of w which are also proper suffixes of XY are of two kinds: first there 
are d(Y ) prefixes which are proper suffixes of Y, then there are d(X) prefixes uy with 
u € A` X, y €Y. These are the only prefixes of w which are in A7 (XY). Since w has 
d(XY ) parses with respect to XY, this gives the formula. 


We now define a formal power series associated to a code X and which plays a 
fundamental role in the following. Let X be a thin maximal bifix code over A. The 
tower over X is the formal power series Tx (also written T when no confusion is 
possible) defined by 

(Tx,w)=d-— (Lx,w). (4.3.4) 


The following proposition states some useful elementary facts about the series T. 


PROPOSITION 3.2 Let X be a thin maximal bifixz code of degree d over A, set P = X AT, 
S = AX, and let T be the tower over X. Then 


(T,w)=0 iff we A(X), 


and for w € H(X), 


1<(T,w) <d-1. (4.3.5) 
Further 
(T,1)=d-1 
and 
X-1 = (A-)T(A-1)+d(4-}), (4.3.6) 
P = (A-1)T+d, (4.3.7) 
S = T(iA-1)+d. (4.3.8) 


3,1 _ 
Proof. According to Theorem ETT, w) = 0 iff w € H(X). For all other words, 
1 < (T,w). Also (T, w) < d—1 since all words have at least one parse, and (T,1) = d—1 
since the empty word has exactly one parse. 
Next by definition of T, we have T + L = dA*, whence 


T(-4A)+L(1-A)=d=(1-A)T+(1-A)L 


The code X is maximal; cgngequently P= A* — XA* and S = A* — A*X. Thus we 
can apply Propositjo Td wi ,-P=U,S= Pogether with the equation above, this 


yields formulas i; ( Ñ and also since 


X -1=P(A-1)=((4-1)T pay - 1). 
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3.2 
Proposition fa Stiows that the support of the series T is contained in the set H(X). 
Note that two thin maximal bifix codes X and X’ having the same tower are equal. 
Indeed, by Proposition EA iAy have the same degree since 


Thao Stat ae 


6 
But then Eq. Sd implies that X = X’. 
Whenever some thin maximal bifix code of degree d = d(X) satisfies some equation 


X -1=(4-1)T(A -1)+d(A - 1), 


then T must be the tower on X. The next result gives a sufficient condition to obtain 
the same conclusion without knowing that the integer d is equal to d(X). 


PROPOSITION 3.3 Let T,T’ € Z((A)) and let d,d’ >1 be integers such that 
(4 -1)T(4 -1) +d(A - 1) = (4 — 1)T'(A -1) +4 (4 — 1). (4.3.9) 
If there is a word w € A* such that (T, w) = (T', w), then T = T' and d = d’. 
Proof. After multiplication on both sides by A* = (1 — A), Eq. Es becomes 
T — dA* =T' — d'A*. 


If (T, w) = (T',w), then (dA*, w) = (d'A *,w). Thus, d = d’, which implies T = T”. 
| 


2.3 
We now observe the effect of an internal transformation (Proposition 2) on the 
tower of a thin maximal bifix code X. Recall that, provided w is a word such that G1, 
Dy, are both nonempty, where 


G=Xw", D=w 'X, Go = (wD)w™, Do = w (Gu), 


G=C-4G..  Di=D- Do 


the code Y defined by 
¥ =X +(1-G@)w(1—DsD) 


is maximal bifix. By Proposition MES the sets G = Xw ! and D = w™!X, are 
maximal suffix and maximal prefix. Let U be the set of proper right factors of G, and 
let V be the set of proper prefixes of D. Then DV is the set of proper prefixes of 
words in D5 D1, since D = Do U D1. Consequently 


G -1=(A-1)U 


=~) 


DDı-1=DV (A-1). 
Going back to Y, we get 
Y -1=X-1+(4-1)UwDV (A-1). 
Let T be the tower over X. Then using Eq. (Era we get 
Y -1=(4-1(T+U wD3V)(A -1)+d(A -1) 
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Observe that since X is thin, both G and D are thin. Consequently also U and V are 


thin. Since Dy = D— AIN Do is not a maximal code. As a subset of D, the set Do 
is thin. By Theorem | Do is not complete. Thus Dé is thin. Thus UwD¥V, as a 
product of thin sets, is thin. Next supp(T) C H(X) is thin. Thus supp(T) U UwDV 
is thin. 


Let u be a word which is not a factor of a word in this set. Then 
(T +UwD3V,u) =0 


On the other hand, formula (EEF Shows that since Gj(wD§)D is thin, the set Y is 
thin. Thus, the support of the tower Ty over Y is thin. Let v be such that (Ty,v) = 0, 
then 

(T+UwDoV, uv) = (Ty, uv) = 0, 


5323 
showing that Proposition BS can be applied. Consequently, 
d(x) = d(Y) and Ty =T+UwDoV. 


Thus, the degree of a thin maximal bifix code remains invariant under internal trans- 
formations. 


EXAMPLE 3.6 The finite Bae hifix code X = {a?, aba, a?b?, ab, ba”, baba, bab”, ba, b?} 


over A = {a,b} of Example as degree 3. This can be seen by observing that no 
word ag pore than 3 parses, and the word a? has 3 parses, or also by the fact (Exam- 
ple at X is obtained from the uniform code A? by internal transformation with 
respect to the word w = ab. Thus d(X) = d(A®) = 3. 


In this example, D(= w~! A?) = G(= A®w7!) = A. Thus 
Tx = T43 +w. 


Clearly 
Tas =2+a+0 


Consequently 
Ty =2+a+0+ab. 


We now give a characterization of the formal power series that are the tower over some 
thin maximal bifix code. 


PROPOSITION 3.4 A formal power series T € N((A)) is the tower over some thin 
maximal bifix code iff it satisfies the following conditions. 


(i) for alla € A, v € A*, 


0 < (f,v)—-(T,av) <1, (4.3.10) 
0 < (T,v)—(T,va) <1; (4.3.11) 


(ii) for alla,be A, v € A*, 


(T, av) + (T, vb) < (T,v) + (T, avb); (4.3.12) 
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(iii) there exists a word v € A* such that 
(Tye) =0. 
w Let X be a thin maximal] ] al pifix co, gf degre et, L be its indicator, and let 
Až — h T n, Rigs. EAT , and are direct consequences of 
T ( : , and (#113). Further ( (iii) holds for all v € H(X), and this set 


is nonempty. 


Conversely, assume that T € N((A)) satisfies the conditions of the proposition. Define 


d= (T,1) +1, L = dA* — 
1.5 
Then by construction, L satisfies the conditions of Proposition ; and consequently 
L is the indicator of some bifix code X. Next by assumption, T has nonnegative 
coefficients. Thus for all wE A*, we have (T,w) = d—(L,w) > 0. Thus, L is bounded. 


In view of Theorem e ade X is maximal and thin. Since (T, v) = 0 for at least 
one word v, we have (L,v) = = d and d = max{(L,w)|w € A*}. Thus, d is the degree of 
X and T = dA* — L is the tower over X. n 


The preceding result makes it possible to disassemble the tower over a bifix code. 


PROPOSITION 3.5 Let T be the tower over a thin maximal bifix code X of degree d > 2. 
The series 


T' =T- H(X) 
is the tower of some thin maximal bifix code of degree d — 1 


Proof. First observe that T’ has nonnegative coefficients. Indeed, by Proposition Bot 
(T,w) > 1 iff w € H(X). Consequently (T’,w) > 0 for w € H(X), and (T’,w) = 
(T, w) = 0 otherwise. 

Next, we verify the three conditions of Proposition Baro 

(i) Leta € A, v € A*. Ifav € H(X), then v € H(X). Thus (7’, av) = (T, av) —1 and 
(T’,v) = (T’,av)—1. Therefore the inequality results from the corresponding inequality 
for T. On the other hand, if av ¢ H(X), then (T, au) = igh av) = 0. Consequently 
(T,v) < 1 whence (T’,v) = 0. Thus the inequality bs iat holds for T”. 

(ii) Let a,b € A and v € A*. If avb € H(X), then (T’, w) = (T,w) — 1 for each of the 
four words w = avb, av, vb, and v. Thus, the inequality 


(T’,av) + (T', vb) < (T',v) + (T', avb) 


results, in this case, from the corresponding inequality for T. On the other hand, if 
ub, H(X), then as before (T, av), (T, vb) < 1 and (T’,av) = (T’,vb) = 0. Thus 
olds for T”. 
3.4 
Condition (iii) of Proposition Baa is satisfied clearly for T” since (T’,w) = 0 for 
w € H(X). Thus T’ is the tower over some thin maximal bifix code. Its degree is 
1+ (T",1). Since 1 € H(X), we have (T’,1) =d — 2. This completes the proof. / 


Let X be a thin maximal bifix code of degree d > 2, and let T be the tower over X. 
Let X’ be the thin maximal bifix code with tower T’ = T — H(X ). Then X’ has degree 
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d—1. The code X’ is called the code derived from X. Since for the indicators L and 
L’ of X and X’, we have L = dA* —T and L’ = (d—1)A* — T', it follows that 


L'=L- H(X) (4.3.13) 


We let X‘”) denote the code derived from X‘"~) for d(X) >n +1. 
PROPOSITION 3.6 The tower of a thin maximal bifiz code X of degree d > 2 satisfies 


T = H(X)+H(X') +--+ H(X)), 


2355 
Proof. By induction, we have from Proposition BS 


T= H(X)+H(X')+---+H(XO%)4+T 


where T is the tower over a code of degree 1. This code is the alphabet, and consequently 


T = 0. This proves the result. n 


We now describe the set of proper prefixes and the set of proper suffixes of words of 
the derived code of a thin maximal bifix code. 


PROPOSITION 3.7 Let X C At be a thin maximal bifix code of degree d > 2. Let 
S=A X; P= XA and H = AXA, H=A* =H. 


1. The set SN H is a thin mazimal prefix code. The set H is the set of its proper 
prefixes, i.e., SN H = HA- H. 

2. The set POH is a thin maximal suffix code. The set H is the set of its proper 
suffices, i.e., PO H = AH — H. 

3. The set SN H is the set of proper suffixes of the derived code X’. 

4. The set PAH is the set of proper prefixes of the derived code X’. 


Proof. We first prove 1. Let T bet e,tower over X, and let T’ be the to grover the 
derived code X’. By Proposition B.J, T = T' + H , and by Proposition BS i 


S=T(4=1) 40 


the series T’(A — 1) +d-—1 is, by formula ( . the characteristic series of the set 
S’ = A” X’ of proper suffixes of words of X’. Thus, 


Thus, S = T'(A —1)+d-14+H(A —-1)4+1. : e code X’ has degree d — 1. Thus, 


S=H(A-1)+1+S' and §'=T(A-1)+d-1 


The set H is prefix-closed and nonempty. We show that cgntains no right ideal. r 
Indeed, the set X is a maximal prefix code by Proposition p | By Proposition i ae 
the set P contains no ri j al and consequently H = A` P contains no right ideal. 
Again by Proposition [I e set Y = HA — H is a maximal prefix code, and 
H = Y A`. Thus 

¥ =H(A-1)41. 
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Furt ging also suffix-closed, the set Y is in fact a semaphore code by Proposi- 
tion | We now verify that Y = SN H. 

Assume that y € Y. Then, from the equation S = Y +S’, it follows that y € S. 
Since H = Y A~, we have y ¢ H. Thus y € SN H. Conversely, assume that y € SN H. 
Then y Æ 1, since d > 2 implies that H # Ø and consequently 1 € H. Further, each 
proper prefix of y is in SA~ = A` XA = H, thus is an internal factor of X. In 
particular, considering just the longest proper prefix, we have y € HA. Consequently, 
ye HA- H=Y. 

The second claim is proved in a symmetric way. To show 3, observe that by what we 
proved before, we have 


S=Y+S'. (4.3.14) 


Next S = (SN H)U (SN H) =YU(SNA), since Y = SN H. Moreover, the union 
is disjoint, thus S = Y +S MH. Consequently S’ = SN H. In the same way, we get 
point 4. m 


THEOREM 3.8 Let X be a thin maximal bifix code of degree d. Then the set S of its 
proper suffixes is a disjoint union of d maximal prefix sets. 


Proof. If d = 1, then X = A and the set S = {1} is a maximal prefix set. If d > 2, 
then the set Y = S N H, where H = ATXA” and H = A* — H, is maximal prefix by 
Proposition B./. Further, the set S’ = SN H is the set of proper suffixes of the code 
derived from X. Arguing by induction, the set S$’ is a disjoint union of d — 1 maximal 
prefix sets. Thus S = Y US” is a disjoint union of d maximal prefix sets. n 

It must be noted that the decomposition, in Theore EN of the set S into dis- 
joint maxi al, prefix sets is not unique (see Exercise BP The following corollary to 
Theorem is rather surprising. 


COROLLARY 3.9 Let X C A* be a thin maximal bifix code having degree d. For any 
positive Bernoulli distribution 7 on A*, the average length of X is equal to d. 


Proof. Let 7 be a positive Berngulli, distribution on A*, and let A(X) be the average 
length of X. By Corollary [7.4, the average length A(X) is finite an (X k= T(S), 
where S = A` X is the set of proper suffixes of X. In view of Theorem “we have 


S =Yı+Yə2+ rt yg 


where each Y; is a maximal prefix code. As a set f factors of X, each Y; also is thin. 
Thus 7(Y;) = 1 for i=1,...,d by Theorem MERE asequently, 


MX) =J rY) =d 


i=1 


.3.9 T 
Note that Corollary Bo can also be proved directly by starting with formula Pea 
However, the proof we have given here is t @ most natural one. 
We now prove a converse of Theorem m 


J. Berstel, D. Perrin and C. Reutenauer Version 22 janvier 2008 


206 IV. Bifix Codes 


PROPOSITION 3.10 Let X be a thin maximal suffix code. If the set of its proper suffixes 
is a disjoint union of d maximal prefix sets, then X is bifiz, and has degree d. 


Proof. Let S = A~X. By assumption S = Yı +---+Yq, where Yj,...,, Yq are maximal 
prefix sets. Let U; be the set of proper prefixes of Y; . Then A* = Y žU ;, and thus 
(1-—Y,)A* =U ;, whence 

Av =U,+Y 54, 


Summing up these equalities gives 
d 
dA* =U; +SA* 
i=1 


Multiply on the left by A — 1. Then, since (A —1)S = X —1, 


-d= (A - 1U: +(X - 1)A*, 


whence 


From this formula, we derive the fact that X A* is right dense. Indeed, let w € AT, and 
set w = au, with a € A. Each of the sets Y; is maximal prefix. Thus, each Y;A* is right 
dense. There exists a word v such that simultaneously auv € Y;A* for all i € {1,...,d} 
and also uv € Y;A* for all i € {1,...,d} . Thus for each i € {1,..., d} 


((A — 1)U ;, wv) = (AU ;,wv) — (Uj, wv) 
= (Uj,uv) —-(U;,wv) =0-0=0. 


Consequently 
(X A*, wv) = (A*,wv) =1. 


Thus, wv € X A*. Conse igntly X A* is right dense or equivalently, X is right complete. 
In view of Proposition ETH means that X is maximal bifix. 

Let w € H(X) be a word which is not an internal factor of X. Then w ¢ U; for 
1 <i <d. The set Y; being maximal prefix, we have w € Y;A* for 1 < i < d. 
Consequently, w has exactly d prefixes which are suffixes of words in X, one in each 
Y;. Thus X has degree d. n 


6 
EXAMPLE 3.7 Let X b he finite maximal bifix code given in Fig. Fea: The tower T 


over X is given by Fig. by its values on th set, Ys. The derived code X” is the 
maximal bifix code of degree,3 of Examples “all i 


e set S’, or proper su 
ofX’, is indicated in Fig. 4.8. The set S of proper suffixes of X is indicated in Fie BR 
The maximal prefix code Y = SN H is the set of wor indicated in the figure by (©). 
It may be verified by inspection of Figs. Beg BS and hat S = SNH. 
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Figure 4.6: A maximal bifix code of degree 4. 3_06 


Figure 4.7: The tower T over X 3_07 


Figure 4.8: The set S$’ of proper suffixes of X 3_08 


Figure 4.9: The set S of proper suffixes of X 3_09 
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4 Kernel 


Let X C At, and let H = ATXA” be the set of internal factors of X. The kernel of 
X, denoted K(X), or K if no confusion is possible, is the set 


K=XO4H. 


Thus a word is in the kernel if it is both in the code and an internal factor of the code. 
As we will see in this section, the kernel is one of the main characteristics of a maximal 
bifix code. 

We start by showing how the kernel is related to the computation of the indicator. 


PROPOSITION 4.1 Let X C A* be a thin maximal bifix code of degree d and let K be 
the kernel of X. Let Y be a set such that 


KCYCX. 
Then for all w € H(X) UY, 
(Ly, w) = (Lx, w) (4.4.1) 
For all w € A*, 
(Lx,w) = min{d, (Ly, w)}. (4.4.2) 


3 
Proof. By formula (ery we have 


Lx = A*(1-X)A*, Ly = 


Let w € A*, and let F(w) be the set of its factors. For any word x € A*, the number 
(A*zA*,w) is the number of occurrences of x as a factor of w. It is nonzero only if 
x € F(w). Thus 
(A*XA*w)= J, (A*eA*,w) 
xe F(w)nx 

showing that if F(w) nN X = F(w) NY, then (Lx,w) = (Ly,w). Thus, it suffices to 
show that F(w) NX = F(w) NY for all w € H(X)UY. From the inclusion Y C X, 
we get F(w) NY C F(w) NX for all w € A*. If w € H(X), then F(w) c H(X) and 
F(w)0X c K(X). Thus F(w) AX c F(w)NY in this case. If w € Y, then no proper 
prefix or z of w is in X, since X is bifix. Thus F(w)N X = {w}U{ A7 wA NX} Cc 
ie : 4: K(X) CY. Consequently F(w) N X C F(w) NY in this case also. This shows 


low, let w € H(X) be an ae factor of X. (Lx,w) < d by ate 
onsequently, (Lx, w) = = {ly.w) by formula ( T le H(X . Then 
(Lx,w) = d. By formula , (Lx, w) < (Ly, w). This proves (| wags 


Given two power series o and 7, we denote by min{o,7} the series defined by 


(min{o,T},w) = min{(o, w), (T, w)}. 
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THEOREM 4.2 Let X be a thin maximal bifix code with degree d, and let K be its kernel. 
Then 
Lx =min{dA*, Lx}. 


In particular, a thin maximal bifiz code is determined by its degree and its kernel. 


Proof, Jake Y = K(X) in the preceding proposition. Then the formula follows from 
eh? Assume that there are two codes X and X’ of same degree d and K(X) = 
K(X’). Then Lx¢x) = L(x) whence Lx = Lx, which in turn implies X = X’. This 
completes the proof. n 

Clearly, the kernel of a bifix code is itself a bifix code. However, not every bifix 
code is the kernel of some bifix code. We now give a characterization of those bifix 
codes which are a kernel of some thin maximal bifix code. For this, it is convenient to 
introduce a notation: for a subset Y of A+, let 


(Y) = max{(Ly,y)| y € Y}; 


it is a nonegative integer or infinity. By convention, u(0) = 0. 


THEOREM 4.3 A bifix code Y is the kernel of some thin maximal bifix code of degree d 
uff 

(i) Y is not maximal bifiz, 

(ii) w(Y) <d-1. 


Proof. Let X be a thin maximal bifix code of degree d, and let Y = K(X) be its 
kernel. Let us verify conditions (i) and (ii). To verify (i), consider a word z € X 
such that (Lx,x) = u(X); we claim that x ¢ H(X). Thus, « ¢ K(X), showing that 
Y Ç X. Assume the claim is wrong. Then uzv € X for some u,v € A*. Consequently, 
(Lx,uxzv) > 1+ (Lx,2x) since the word wav has the interpretation (1, uxv,1) which 
passes through no point of x. This contradicts the choice, o dy and proves the claim. 
Next, for all y € Y, we have (Lx,y) = (Ly, y) by formula ee ir Since (Lx,y) <d-1 
because y € H(X), condition (ii) is also satisfied. 

Conversely, let Y be a bifix code satisfying conditions (i) and (ii). Let L € N((A)) 
be the formal power series defined for w € A* by 


(L,w) = min{d, (Ly, w)}. 


_ 1.5 
Let us verify that L satisfies the three conditions of Proposition (rat First, let a € A 
and w € A*.Then 

0 < (Ly, aw) — (Ly,w) < 1. 


It follows that if (Ly, w) < d, then (Ly, aw) = (L, aw); on the other hand, if (Ly, w) > 
d, then (L,aw) = (L,w) =d. Thus 


0 < (L,aw) — (£,w) <1. 


The symmetric inequality 
0 < (L,wa) —(L,w) <1 


1.5 
is shown in the same way. Thus the first of the conditions of Proposition i E is satisfied. 
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Next, for a,b € A, w E€ A*,(Ly, aw) + (Ly, wb) > (Ly, w)+ (Ly, awb). Consider first 
the case where (Ly, w) > d. Then (L, aw) = (L, wb) = (L, w) = (L,awb) = d, and the 
inequality 

(L,aw) + (L, wb) > (L, w) + (L, awb) 


is clear. Assume now that (Ly,w) < d. Then (Ly,aw) < d and (Ly,wb) < d. 
Consequently 
(L,aw) + (L,wb) = (Ly,aw) + (Ly, wb) 


(Ly, w) + (Ly, awb) 
(L,w) + (L, awb) 


IV IV 


since L < Ly. This shows the second condition. 
Finally, we have (Ly,1) = 1, whence (LZ, 1) = 1. 
1.5 

Thus, according to Proposition H tie series L is the indicator of some bifix code 
X. Further, L being bounded, the code X is thin and maximal bifix by Theorem B.I} 
By the same argument, the code Y being nonmaximal, the series Ly is unbounded. 
Consequently, max{ (L, w) | w € A*} = d, showing that X has degree d. 

We now prove that Y = X N H(X), i.e., Y is the kernel of X. First, we have the 
inclusion Y X) Indeed, if y € Y, then (L, y) < (Ly,y) < (Y) < d-— 1. Thus, 
by Theorem B.I y € H(X). Next, observe that it suffices to show that X N H(X) = 
Y N H(X); this is equivalent to showing that 


3.1 


(X, w) = (¥,w) 


for all w € H(X). Let us prove this by induction on |w|. Clearly, the equality holds 
for |w| = 0. Next, let w € H(X) -— 1. Then (L,w) < d— 1. Thus, (L, w) = (Ly, w). 
This in turn implies 

(A*X A*,w) = (A*YA*,v). 


But F(w) c H(X). Thus, by the induction hypothesis, (X ,s) = (Y_,s) for all proper 
factors of w. Thus the equation reduces to (X ,w) = (X , w). T 


We now describe the relation between the kernel and the operation of derivation. 


PROPOSITION 4.4 Let X be a thin maximal bifiz code of degree d > 2, an, let H = 
A` XA. Set 


K=XnH Y=HA-HE Z=AH-F 
Then the code X' derived from X is 
X'=KU(YNAZ). (4.4.3) 


Further, 


K=XnX'. (4.4.4) 
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prefixes of words in X. Let S = SN H and P’ = PA H. According to Proposition 
S’ is the set of proper suffixes of words in X’ and similarly for P’. Thus, 


Proof. Let S = A~X and P = X A` be the sets of proper right factors and of RS ay 


X'-1=(A-1)S’=As’- $8". 
From S’ = SN H, we have AS’ = ASM AH, and 
AS’=AS OAH, 
i i®n0.6 
where © denotes the Hadamard product (see Section Dip). US, 


X'-1=(AS ©AH)-8$". 


3.7 ; 
Now observe that, by Proposition Bo the set Z is a maximal suffix code with proper 
suffixes H. Thus, (Z — 1) = (A — 1)H and 


Similarly, from X — 1 = (A —1)S we get 


Substitution gives 


X’-1 = (X -1+S)0(Z-1+H)-S' 
XNZ+S80Z+XNH+S8NH +1-(10H)-(§$ ©1)-S8". 


Indeed, the other terms have the value,0 since neither X nor Z contains the empty 
word. Now Z = PN H (Proposition Ease on XNZ=XNPNH =O. Also by 
definition S’ = SN H and K = X N H. Thus the equation becomes 


X'’-1=SNZ+K —-1. 

sak = 
Finally, note that by Proposition Bay SNAH. Thus, 
SNZ=SNAPAHÄ=YNZ 


and 


3 
showing (EEI Next 


X'=KU(YNANZ), 


XNAX'=(KAX)U(XNAYNAZ). 
Now XNYNZ=XNPOSOH =Í, and KNX = K.Thus, 
XAX' =K 


as claimed. m 
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PROPOSITION 4.5 Let X be a thin maximal bifix code of degree d > 2 and let X’ be the 


derived code. Then 
K(X") C K(X) GX". (4.4.5) 


Proof. First, we show that H(X’) c H(X). Indeed, let w X’). Then we have 
(Tx, w) > 1, where Tx is the tower over X’. By Proposition BEER. w) = (Tx, 

(H(X),w). Thus, (Tx,w) > 1. This in turn implies that w € H(X) by Proposition 
By definition, K(X’) = X'N H(X’). Thus, K(X’) c X'N H(X). By P 


X' = K(X) UT, where T is a set disjoint from a Thus x (jax er 
This shows that K(X’) c K(X). Next, formula ko shows that, (X) C = ' 
Finally, we cannot have the equality K(X) = X’, since by Theorem i K(X 


is not a maximal bifix code. 
f . o .4.5 
The following theorem is a converse of Proposition frst 


THEOREM 4.6 Let X’ be a thin maximal bifix code. For each set Y such that 


K(X) CY GX’ (4.4.6) 
there exists a unique thin maximal bifix code X such that 
(i) K(X) =Y, 
(ii) d(X) =1+d(X’). 
Moreover, the code X' is derived from X. 


Proof. We first show that is , the kernel of some bifix code. For this, we verify 
the conditions of Theorem e st igt 2 inclusion Y C X’ shows that Y is not a 
maximal code. Next, by Proposition BHA, y) (Lx,y) for y € Y. Thus, setting 


d= d(X') +1, we have Acer )=d-1. 

According to Theorem 4.3, there is + thin maximal bifix code X having degree d 
such that K(X) = Y. By Theorem is code is unique. It remains to show that 
X’ is the derived code of X. Let Z be the derived code of X, BY Proposition BR? 
K(Z) c K(X) = Y Q Z. Thus we may apply Proposition Frit showing that for all 
we A*, 

(Lz,w) = min{d — 1, (Ly,w)}. 
The inclusions of formula fre zive, by Proposition Al 


(Lx:,w) = min{d — 1, (Ly, w)} 


for all w € A*. Thus Ly = Lz whence Z = X’. n 
Proposition ‘shows that the kernel of a code is located in some “interval” deter- 
rnined by the derived code. Theorem shows that all of the “points” of this interval 
can be used effectively. eee 
More precisely, Proposition aad Theorem show that there is a bijection be- 
tween the set of thin maximal bifix codes of degree d > 2, and the pairs 
(XY) 


.6 
composed of a thin maximal bifix code X’ of degree d— 1 and a set Y satisfying Eige 
The bijection associates to a code X the pair (X’, K(X)), where X’ is the derived code 
of X. 
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.3.3 ; 
EXAMPLE 4.1 We have seen in Example Bp ttiat any maximal bifix code of degree 2 
has the form 

X = C U BC*B, 


where the alphabet A is the disjoint unio B and C, and B Æ Ø. This observation 
can also be established by using Theorem ndeed, the derived code of a maximal 
bifix code of degree 2 has degree 1 and therefore is A. Then for each proper subset C 
of A there is a unique maximal bifix code of degree 2 whose kernel is C. This code is 
clearly the code given by the above formula. 


EXAMPLE 4.2 The number of maximal bifix codes of degree 3 over a finite alphabet A 
having at least two letters is infinite. Indeed, consider an infinite thin maximal bifix 
code X’ of degree 2 Jtg kernel K (X’) is a subset of A and consequently is finite. In 
view of Theorem each set K containing K(X’) and strictly contained in X’ is the 
kernel of some maximal bifix code of degree 3. Thus, there are infinitely many of them. 


5 Finite maximal bifix codes 


Finite maximal bifix codes have quite remarkable properties which make them fasci- 
nating objects. 


PROPOSITION 5.1 Let X C A? be a finite maximal bifix code of degree d. Then for 
each letter a € A, 
a eX. 


hapteri 
With the terminology introduced in Chapter [this is equivalent to saying that the 
order of each letter is the degree of the code. 
Proof. Let a € A. According to Proposition Et there is an integer n > 1 such that 
a” € X. Since X is finite, there is an integer k such that a* is not an internal factor 
of X. The number of parses of a” is equal to d. It is also the number of suffixes of a” 
which are proper prefixes of words in X, that is n. Thus n = d. n 


Note as a consequence of this result that it is, in general, impossible to complete 
a finite bifix code into a maximal bifix code which is finite. Consider, for example, 
A = {a,b} and X = {a?,b®}. A finite maximal bifix code containing X would have 
simultaneously degree 2 and degree 3. 

We now show the following result: 


THEOREM 5.2 Let A be a finite set, and let d > 1. There are only a finite number of 
finite maximal bifix codes over A with degree d. 


Proof. The only maximal bifix code over A, having degree 1 is the alphabet A. Arguing 
by induction on d, assume that there are only finitely many finite maximal bifix codes 
of degree d. Each finite maximal bifix code of degree d + 1 is determined by its kernel 
which is a subset of X’. Since X’ is a finite maximal bifix code of degree d there are 
only a finite number of kernels and we are finished. n 
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Denote by 6p(d) the number of finite maximal bifix codes of degree d over a k letter 
alphabet A. 

-2.3 

Clearly @,(1) = 1. Also 6%(2) = 1; indeed X = A? is, in view of Example E 

only finite maximal bifix code of degree 2. It is also clear that 3)(d) = 1 for all d > 1. 


EXAMPLE 5.1 Let us verify that 
fala) = 3. (4.5.1) 


Let indeed A = {a,b}, and let X C A* be a finite maximal bifix code of degree 3. 
The derived code X’ is necessarily X’ = A?, since it is the only finite maximal bifix 
code of degree 2. Let K = X ' be the kernel of X. Thus K c A?. 

According to Proposition Basen a®,b® € X. Thus K cannot contain a? or b?. 
Consequently, K C {ab, ba}. We next rule out the case K = {ab, ba}. Suppose indeed 
that this equality holds. For each k > 1, the word (ab)* has exactly two X parses. But 
X being finite, there is an integer k such that (ab)* € H(X), and (ab)* should have 
three X parses. This is the contradiction. 

Thus there remain three candidates for K: K = 0 w, Lich, correspond to X = A, 
then K = {ab}, which gives the code X of Example BY BS 5; = {ba} which gives 
the reverse X of the code X of Example Pot this shows - Note also that this 
explains why X is obtained from X by exchanging the letters a and b: this property 
holds whenever it holds for the kernel. 


We now show how to construct all finite maximal bifix codes by a sequence of internal 
transformations, starting with a uniform code. 


THEOREM 5.3 Let A be a finite alphabet and d > 1. For each finite maximal bifiz 
code X C At of degree d, there is a finite sequence of internal transformations which, 
starting with the uniform code A“, yields X. 


Proof. Let K be the kernel of X. If K = 0, then X = A? and there is nothing to prove. 
This holds also if Card(A) = 1. Thus we assume K 4 and Card(A) > 2. Let x € K 
be a word which is not a factor of another word in K. We show that there exist a 
maximal suffix code G and a maximal prefix code D such that 


GrDC X. (4.5.2) 


Assume the contrary. Let P = X A~. Since x € K, x is an internal factor. Thus the 
set Px! is not empty. Then for all words g € Px~', there exist two words d,d’ such 
that 

gzd, gxdď € X and X(zd) t Æ X(ad')"!. 


Suppose the contrary. Then for some g € Pa~!, all the sets X(ad)~', with d running 
over the words such that grd € X, are equal. Let D = {d | grd € X} and let 
G = X(ad)~!, where d is any element in D. Then GrD C X, contradicting our 
assumption. This shows the existence of d, d’. 

Among all triples (g, d,d’) such that 


gud, gzd' € X and X(zd)™t 4 X(ad')—, 
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let us choose one with |d| + |d'| minimal. For this fixed triple (g, d,d’), set 
G=X(ad)* and GQ = X (ad)! 


Then G and G’ are distinct maximal suffix codes. Take any word h € G — G’. Then 
either h is a proper right factor of a word in G’ or has a word in G’ as a proper suffix. 
Thus, interchanging if necessary G and G”, there exist words u, g’ € AT such that 


g €G, ug € C. 


Note that this implies 
g'zxdE X, ug ad’ € X. 


Now consider the word ug'xd. Of course, ug'xd ¢ X. Next ug'xd ¢ P, since otherwise 
g'xd € K, and z would be a factor of another word in K, contrary to the assumption. 
Since ug'xd ¢ PU X, it has a proper prefix in X. This prefix cannot be a prefix of ug'z, 
since ug’xd’ € X. Thus it has ug'x as a proper prefix. Thus there is a factorization 


d=d''v 


with d”,v € At, and ug’ad” € X. 

Now we observe that the triple (ug’,d’,d’) has the same properties as (g,d,d'). In- 
deed, both words ug'ad’ and ug'xd” are in X. Also X(ad')"! #4 X(axd")~" since 
gud’ € X, but ggd” ¢ X: this results from the fact that gard” is a proper prefix of 
gud € X (Fig. . Thus, (ug’,d’,d”) satisfies the same constraints, a: me d'): 
however, |d’| + |d”| < |d'| + |d|. This yields the contradiction and proves ‘ey Let 


Y = (X U Gr U zD) — (x U Gx D). (4.5.3) 


internal transformation with respect to x transforms Y into X. Finally shows 
that 


l N 2.4 . ; . 
In view of Proposition Bat the set Y is a finite maximal bifix code, and more ver, the 
E 


Card (Y) Card(X) + Card(G) + Card(D) — 1 — Card(G) Card(D) 


Card(X) — (Card(G) — 1)(Card(D) — 1). 


The code G being maximal suffix and Card(A) > 2, we have Card(G) > 2. For the 
same reason, Card(D) > 2. Thus 


Card(Y) < Card(X) — 1. (4.5.4) 


Arguing by induction on the number of elements, we can assume that Y is obtained 
from A? by a finite number of internal transformations. This completes the proof. n 


4 
Observe that by this theorem [and formula BE each finite maximal bifix code 
X C A* of degree d satisfies 


Card(X) > Card(Aĉî) (4.5.5) 
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Figure 4.10: From triple (g,d,d') to triple (ug’,d’,d"). 3_10 


with a lity iff X = Aĉ. This result can be proved directly as follows (see also 
Exercise EE 
Let X be a finite maximal prefix code, and 


A= So pkt 
LEX 


with k = Card(A). The number A is the average length of X with respect to the 
uniform Bernoulli distribution on A*. 


Let us show the inequality 
Card(X) > kò. (4.5.6) [eq3.5.6] 
F. imal bifi g X fd date A = d (Corollary BO) and thus G 
or a maximal bifix ¢ode Ao -~ have = d (Corollary , and thus 


is a consequence of To show , let n = Card(X). Then 


d= S$) kl logg alt, 


xEX 
log, n = > kll log, n. 
zEX 


The last equality follows from 1 = ` ex k—|*l, which holds by the fact that X is a 
finite maximal prefix code. Thus, 


A—log,n = X kl log, (kl /n). 
rExX 


The function log being concave, we have 


1 


xEX 


.6 
This shows (ay 


EXAMPLE 5.2 Let A = {a,b} and let X he the finite maximal bifix code of degree 4 
with literal representation given in Fig. | The kernel of X is K = {ab,a?b?} . There 


is no pair (G, D) composed of a maximal suffix code G and a. maximal prefix code D 
such that GabD C X. On the other hand 


Aa2b?A CX. 


The code X is obtained from the code Y given in Fig. ET by internal transforma- 
tion relatively to a?b?. The code Y is obtained from A* by the sequence of internal 
transformations relatively to the words aba, ab”, and ab. 
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Figure 4.11: The code X. 3_11 


Figure 4.12: The code Y. 3_12 


We now describe the construction of a finite maximal bifix code from its derived code. 

Let Y C At bea bifix code. A word w € A* is called full (with respect to Y) if there 
is an interpretation passing through any point of w. It is equivalent to saying that w 
is full if any parse of w is an interpretation. 

The bifix code Y is insufficient if the set of full words with respect to Y is finite. 


PROPOSITION 5.4 A thin maximal bifix code over a finite alphabet A is finite iff its 
kernel is insufficient. 


Proof. Suppose first that X is finite. Let d be its degree, and let K be its kernel. 
Consider a word w in H(X). Then w has exactly d X-interpretations. These are not 
all K-interpretations, because K is a subset of the derived code of X, which has degree 
d—1. Thus, there is a point of w through which no K-interpretation passes. Thus, w is 
not full (for K). This shows that the set of full words (with respect to K) is contained 
in H(X). Since H(X) is finite, the set K is insufficient. 
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Conversely, suppose that X is infinite. Since the alphabet A is finite, there is an 
infinite sequence (an)n>o of letters such that, setting P = XAT, we have for all n > 0. 


Dn = A901 * +: an E€ P 


Note that there are at most d(X) integers n for which p, is a proper suffix of a word 
in X. Similarly, there exist at most d(X) integers n such that for all m > 1, 


On4+14n4+2°** Anim E P. 


Indeed, each such integer n defines an interpretation of each word aoa1:--ar, (r >n), 
which is distinct from the interpretations associated to the other integers. 

These observations show that there exists an integer k such that for all n > k, the 
following hold: pp € A*X and ani1dni2°:'Gntm E X for some m > 1. The first 
property implies by induction that for all n > k, there is an integer i < k such that 
Qi: An E X*. 

Let w, = akak+1 `: ak4 for L > 1. We show that through each point of w; passes a 
K-interpretation. Indeed, let 


U = Akak41°°* an, U = Gn414n4+2°°* Akl 


for some k < n < k +1. There exists an integer i < k such that a;---a,_yu E€ X*, and 
there is an integer m > k+ I such that vak4ı am € X*. In fact, these two words are 
in H(X) A X* and consequently they are in K*. This shows that K is a sufficient set 
and completes the proof. a 


The previous proposition yields the following result. 


THEOREM 5.5 Let X’ be a finite marimal bifix code of degree d — 1 and with kernel 
K'. For each insufficient subset K of X' containing K', there exists a unique finite 
maximal bifix code X of degree d, having kernel K. The derived code of X is X’. 


Proof. Since K is in ient, K is not a maximal bifix code. Thus K’ cC K ¢ X. 
LE 


In view of Theorem K.O, there is a unique thin maximal, bi x gode X of degree d and 
kernel K. The derived code of X is X’. By Proposition 5.4, the code X is finite. n 


The following corollary gives a method for the construction of all finite maximal bifix 
codes by increasing degrees. 


COROLLARY 5.6 For any integer d > 2, the function 


is a bijection of the set of finite maximal bifix codes of degree d onto the set of all 
insufficient subsets K of finite maximal bifix codes X’ of degree d— 1 such that 


KiXVCK GX, 
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EXAMPLE 5.3 Let A = {a,b}. For each integer n > 0, there exists a unique finite 
maximal bifix code X, C At of degree n + 2 with kernel 


Ky = {a| 1 <i< nt}. 


For n = 0, we have Ko = Ø and Xp = A’. Arguing by induction, assume X, con- 
structed. Then Kn C Xp and also a”*?, b”*? € Xn, since d(X,) =n+2. We show that 
a”+lpr+! € Xp. Indeed, no proper prefix of a”+!b"+! is in X, since each has a suffix 
in X, or is a proper suffix of a”t?, Consider now a word a”*!p"+* for a large enough 
integer k. Since Xn is finite, there is some prefix a”t+!b"+" € Xp for some r > 1. If 
r > 2, then b"+? is a suffix of this word. Thus r = 1, and a” ttp +i € X. 

Clearly Kn C Ky41. The set Kn+1 is insufficient. In fact, a has no Kn+1 interpreta- 
tion passing through the point (a,1) and b has no interpretation passing through the 
point (1,6). Therefore, the set of full words is {1}. Finally 


Kn C Knit & Xn 


This proves the existence and unicity of Xn+1, by using Th ogepm 
The code Xj, is the cod of gegree 3 given in Example F a Xə is the code 
of degree 4 of Example ee 


We finish this section with some remarks on the length distribution of bifix codes. 
Contrary to the case of codes or of prefix codes, it is not true that any sequence (Un)n>1 
of integers such that }>,., Unk” < 1 is the length distribution of a bifix code on k 
letters. For instance, there is no bifix code on the alphabet {a,b} which has the same 
distribution as the prefix code {a, ba, bb}. Indeed, such a code must contain a letter, 
say a, and then the only possible word of length 2 is bb. We show that the following 
holds. 


PROPOSITION 5.7 For any sequence (Un)n>1 of integers such that 
spew all 
do tink" S 5 (4.5.7) 
n>1 
there exists a bifix code on an alphabet of k letters with length distribution (Un)n>1- 


Proof We show by induction on n > 1 that there exists a bifix code Xn of length 
distribution (u;)1<i<n on an alphabet A of k symbols. It is true for n = 1 since 


uik™t < K: aforan u, < k. Assume that the property is true for n. We have by 


Inequality ( 
n+1 


jol 
=] 
or equivalently, multiplying both sides by 2k"*1, 
2(u1k” +... + unk btn) < ket! 


whence 
Uns < 2Uns1 < kt) — 2(u1k” +... + unk). (4.5.8) 
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Since X,, is bifix by induction hypothesis, we have 
Card(X,,A* N A”*") = Card(A*X, NA") = urk” +... + Unk. 
Thus, we have 


Card((X,A* U A* Xn) N A") < Card(X,A*n A") + Card(A*X, N A"*) 


< uk” +...+ Unk) 


N . =iņtermediaire 
It follows with Equation (a) that 


Un+1 Š krt! = 2(u,k” Tesa F Unk) 
< Card(A”+!) = Card((Xn A* U A* Xn) NA?) 
= Card(A"+! — (X„A* U A*Xn)) 


This shows that we can choose a set Y of un+41 words of length n + 1 on the alphabet 
A which do not have a prefix or a suffix in Xn. Then Xn+1 = Y U Xn is bifix, which 
ends the proof. m 


Half 
The bound 1/2 in the statement of Proposition Eiis Probably not the best possible. 
It is conjectured that the statement holds with 3/4 instead of 1/2. For convenience, we 
call a sequence (un) of integers k-realizable if there is a bifix code on k symbols with 
this length distribution. 

We fix N > 1 and we order sequences (un)i<n<yN Of integeers by setting (un) < (Vn) 
if and only if un < Un for d FLAA E (un) < (vn) and (vn) is k-realizable then so is 
(un). We give in Table f i the values of the maximal 2-realizable sequences for N < 4. 
For each value of N, we list in decreasing lexicographic order the maximal realizable 
sequence with the corresponding value of the sum u(1/2) = X` un2 ”. The distributions 
with value 1 correspond to maximal bifix codes. For e pple, the distribution (0, 1, 4, 4) 
corresponds to the maximal bifix code of Example pao 

It can be checked on this table that the minimal value of the sums u(1/2) is 3/4. 
Since the distributions listed are maximal for componentwise order, this shows that for 
any sequence (tn)i<n<n Whith N < 4 such that u(1/2) < 3/4, there exists a binary 
bifix code X such that wx = u. ‘4 

Since a thin maximal bifix code is also maximal as a code (Proposition Sheneth maxpi 
distribution satisfies u(1/k) = 1, where k is the size of the alphabet. Table H a the 
length distributions of finite maximal bifix codes of degree d < 4 over {a,b}. For each 
degree, the last column contains the number of bifix codes with this distribution, with 
a total number of 73 of degree 4. There are 39 of them with {a,b}? as derivative and 
34 with one of the two other bifix codes of degree 3 (see the exercises). 


6 Completion 


For a finite bifix code X, a simple construction shows that it is contained in a maximal 
rational bifix code. Indeed, either X is already maximal, or it is, for each large ẹ oygh 
integer d the „Krnel of a maximal rational bifix code of degree d (Theorem gea 


Exercise F i : 
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[ORR a eae a E ERA 


ooocoocoocorerer+ere+}HrHRYPONMNNwWWwWHTTOOF 
GDOrFnNMNwWPR TDN WOOFNWHRTODORFNODFOOCOFN FH 


; 
1 
1 
1 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 


Table 4.1: The list of maximal 2-realizable length distributions of length at most N < 4. 


For a rational bifix code X which is not maximal, it is not true in general that it is 
the kernel of a maximal rational bifix code. Instead of acting from the outside, adding 
words having the words of X as factors, one has to work from the inside, adding first 
words which are factors of words of X (and therefore part of the kernel of the result). 


THEOREM 6.1 Each rational bifiz code is contained in a maximal rational bifix code. 
Let Y C A* be a bifix code. Recall that its indicator is the formal series defined by 
Ly = A*(1-Y)A*. 


We shall need several properties of the indicator, grouped in the following lemma for 
convenience. 


LEMMA 6.2 Let Y C A®* be a bifix code and L its indicator. For any words u,v,w and 
letter a, the following hold. 
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dji | 2. (30 


Aa 
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N O 


Co 


SOS SC -O SO OS SS OOO: OS: oS oS 
ee ce ce eG Gn a Gn Gn n> a O 
NOrrerFrRFOoOooocooooBWWWNNF OO 
wok BW ONNDMDMOWA AD OO 

= 

bo 
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co FP OF BRB BP 
NPR RR YOY KBKRNYNBKP KBR DWAAHDE 


CW nnNonwma® ow 0 


=. 
N 


KALRN = r 
0 1 4 4/2 


CHI HI BI e 


Table 4.2: The length distributions of binary finite maximal bifix codes of degree at 
most 4. 


(1) For eachi=1,...,(L,w), there is a prefix p of w such that (L, p) = i. 

(2) IfY is a rational set and is not a maximal code, then for any u, the set of values 
(L, uv) is unbounded for v € A*. 

(3) (L, w) = (L, wa) if and only if wa has a suffiz in Y. 

(4) If (L, v) = (L, uv), then uv has a prefiz in Y. 

(5) IfY cC Y’, then Ly > Ly». 


a 1.4 
Proof. Property (1) is an easy conseqyence of, Proposition fear (2), we note that 
a rational oe is A aa Ito. 144) if Y is rational and not maximal, L is 


onion core ence, (L, v) is arbitrarily large, and so is (L, wv) > (L,v) 


naan 

By the definition of L, (L, w) is equal to |w| + 1— the numbers of factors of w which 
are in Y. This number of factors is the same for wa, except if wa has a suffix in Y, in 
which case wa has exactly one more (since Y js, a suffix code). This implies (3). For 
(4), assume (L,v) = (L,uv). By Proposition fae have (L,v) = (L,u’v) for each 
suffix u’ of u; hence by the symmetric statement of (3), an easy induction on the length 
of u’, starting with |u’| = 1, shows that u’v has a prefix in Y. Thus wv has a prefix in 
Y. Property (5) is also a direct consequence of the definition of Ly. n 


The idea of the construction is the following. Starting with a rational bifix code 
X = Xo C A’, we build an increasing sequence of sets (Xn)n>1 which all are shown 
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to be rational bifix codes. It will then be proved that for some n, Xn is a maximal 
rational bifix code containing X, thereby proving the theorem. 

For any language Y, we denote by P(Y) the set of words of Y which are minimal for 
the prefix order. Thus, w € P(Y) if and only if w is in Y and has no proper prefix in 
Y. Moreover, [(Y) denotes the set of words in A* which are incomparable with Y for 
the prefix order. In other words, w € I(Y) if and only if w is not a prefix of a word 
in Y and has no prefix in Y. Finally, we denote by Y’ the set of P(I(Y)). It is called 
the companion of Y. The companion of a bifix code should not be confused with its 
derived code. 

Recall also that A~Y (resp. Y A`) denotes the set of proper suffixes (resp. prefixes) 
of words in Y. 

For the construction of the sequence Xn, we first add to the bifix code X the set 
of words of the companion X’ of X with minimal value of the indicator L which are 
not suffixes of X. We then iterate this procedure on the resulting code. The codes are 
indeed bifix, as shown in the following statement. 


PROPOSITION 6.3 Let X = Xo be a bifix code. Define recursively, for n > 0: 


Vn = {we A*| (Lx,,w)=n+]}, (4.6.1) 
Zn = I(Xn) AO P(Vn), (4.6.2) 
Xn = XnU(Zn\ AX). (4.6.3) 


For each n > 1, the set Xn is a bifix code and (Lx„,w) < n for all w € Xn \ X. 


Note that the union defining Xn+1 is disjoint, since Zn C I(X,) and I(X,,) cannot 
intersec Xr For sake of simplicity, we write Ln for Lx, In particular, Ln+1 > Ln by 
Lemma 5) because Xn is a subset of X41. 


Proof. Assume that X, is a bifix code and satisfies the jnequality in the statement. We 
show that the same hold for X,41. By Equation ees Zn is a prefx code which is 
incomparable with X,, for the prefix order. In view of Equation ( , Xn+1 of two 
prefix codes which are incomparable for the prefix order. Thus Xp, itself is a prefix 
code. 

It remains to show that Xn+1 is a suffix code. By contradiction, suppose that for 
some x, 27’ € Xn+1, v is a proper suffix of x’. By construction, we have two cases: either 
£ E Xn, or re Z,\AX. 

In the first case, we have x’ ¢ Xn, since Xn is a suffix code by induction. Thus 
x’ € Z,\ AX and wv’ € P(V,), hence v’ is in Vp, and by definition of the latter, 
(Ln, 2’) =n+1. Write 2’ = wa, a € A. Since x’ has a suffix in Xn (namely z itself), 
we have (Ln, w) = (Ln, wa) by Lemma (3). Thus (Ln,w) = n+ 1, which implies 
that w € Vn. This contradicts the fact that x’ € P(Vn). 

In the second case,  € Zn, hence x € Vp and (Ln, 2) = n+1. Moreover, 2’ ¢ X 
(otherwise x € A~X). Suppose that x’ € Xp, Then x’ € Xn \ X and by the induction 
hypothesis, (Ln, x’) < n. By Proposition Ei tis yields a contradiction, since x is a 
factor of x’. Thus we have 2’ € Zn \ ATX , This implies x’ € V, and consequently 
(Ln, x’) =n+1= (Ln, x). From Lemma 4), we deduce that x’ has a prefix in Xn, 
a contradiction, since x’ € Zn C I(Xn). We conclude that Xn+1 is a bifix code. 
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It remains to prove that (Ln41,2) < n+ for £ E€ Xn4i1\X. Let indeed x € Xn41\X. 
Since Xn C Xnii, we have by Lemma Bo. (Ln+1;£) < (Ln). fx € Xn, 
then (Ln,x) < n by the induction hypothesis; if £ Xn, then x € Zn C Vn, and 
(Ln,£) =n +1. In both case, we conclude that (Ln41,£) <n+1. n 


LEMMA 6.4 Let X = Xo be a rational bifix code. For each n > 1, the set Xn is a 


rational set. 


Proof. We prove the statement by induction on n. It is true for n = 0 by hypothesis. 
Suppose next that Xn is rational. Let Un = A*\ X,A*. This set is rational. According 
to , for any word z, (Ln, z) is the number of suffixes of z which are in Uy. 

Let A = (Q,i,T) be a deterministic automaton recognizing Un. Let B = (QUw,w, TU 
w) with w ¢ Q be the automaton obtained as follows. The edges are those of A plus a 
loop (w,a,w) for each letter a in A and an edge (w,a,q) for each edge (i,a,q) of A. 

Then, for any word z, the number of successful paths labeled by z starting in w is 


equal to the number i niana which are in U,,. In other words, (Ln, z) = (|B|, z). 


Thus, by Proposition , the set Vp is rational. Since [(X,) = A* \ XnA7 \XnA*, 
the set [(X,,) is rational. Since P(V,) = Vn — Vn A*, the set P(V,,) is also rational. 
Thus Z,, is rational and so is Xņn+1- m 


From now on, we assume that X = Xo s3 rational bifix code. In order to prove the 
theorem it is enough, in view o Lemma Hio show, that, Xn is a maximal bifix code 
for some n. By Theorem [6.9 and Proposition 1t is therefore enough to show 
that Xn is a right complete prefix code. This is the scope of the following lemmas. 

Given a partially ordered set S, the height of an element s of S, denoted h(s), is the 
maximal length of the strictly increasing chains ending in s. The height of S is maximal 
height of its elements, so it is simply the maximal length of a strictly increasing chain 
of elements in S. The height is finite or infinite. 

It follows from Proposition that for a rational prefix code Y, the height of the 
set of suffixes of Y for the prefix order is finite. A symmetric property holds for suffix 
codes; we denote by m the height of the set of prefixes of X for the suffix order. 

Recall that X’ = P(I(X)) denotes the companion of X. Thus, a word is in X’ if it 
is incomparable with the words of X for the prefix order and has no proper prefix with 
this property. 


le-3| LEMMA 6.5 The height of X' for the factor order is at most n. 


Proof. Assume, arguing by contradiction, that there is a strictly increasing chain for 
the factor order £o, £1, %2,..., £v Of length m + 1 with z; € X’. Since X’ is a prefix 
code, x; is not a prefix of zi+1. We may write zi = pisi, in such a way that each hain 
a proper suffix of pi+1, each s; is a nonempty proper prefix of s;}ı (see Figure eens 
Note that p; Æ pi41, since x; is not a prefix of x;,1. Hence po,...,pq_ is a strictly 
increasing chain for the suffix order. 
We prove that each p; is a prefix of X which yields a contradiction in view of the 
definition of 7. Indeed, each p; is a proper prefix of x; Since x; € P(I(X)), the word 
pi is not in I(X) and therefore p; is comparable with X. It cannot have a prefix in 
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To 
Ly 
T2 


Figure 4.13: A chain for the factor order 


X, since otherwise x; is comparable with X and this would imply that x; ¢ I(X), a 
contradiction. Thus p; is a prefix of X. 


Consider X’, the companion of X, ordered by the factor order. Define for i > 1 
X; = {w € X' | h(w) < i} 


where h(w) denotes the height of w in the partially ordered set X’. In particular, X{ 
is the set of words in X’ which are minimal for the factor order. The previous lemma 
shows that X/ = X’. 

Let o be equal to 1+ the height of set of suffixes of X for the prefix order. 


LEMMA 6.6 For any n > 0, the indicator Ln is bounded on the set of words such that 
all their proper suffixes are comparable for the prefix order with some word in Xn. 


Proof Let T be the set of words such that all their proper suffixes are ¢ parable for the 
prefix order with some word in X,,, and let w € T. Then by Lemma Rete, w) =1442, 
where £ is the number of proper suffixes of w which belong to A* \ X,,A*; now, since 
none of them is in [(X,,), they all belong to X„ A”. 

Therefore / is bounded by the maximal length of increasing chains of prefixes of 
X, for the piga This number is bounded, by the symmetric statement of 


Proposition | since X,, is rational. m 


LEMMA 6.7 There exists m such that Lm is bounded on the companion X' of X. 


Proof. We prove by induction on i > 1 that there exists k such that Lẹ is bounded on 
a 

For i = 1,,we prove that Lo is bounded on Xj. For this, we show that we may apply 
Lemma with n = 0 and T = X}. Indeed, assume on the contrary that some v € X} 
has a proper suffix s which is in [(X). Then some prefix of s is in P(I(X)) = X’, and 
v has a proper factor in X’, which contradicts the definition of X}. 

Suppose now that i > 1. By the induction hypothesis there are integers m and £ such 
that Lm(w) < £ for all w € X/_,. We may suppose that m < £. Let k = £+ o where o 

as, defined above. Since m < +0, we have Xm C Xe+o and Lm > Lyi, by Lemma 

). Thus Ly is bounded on Xj_,. It remains to show that Lẹ is bounded on X%. 

Let w € X; \ X{_,. We show that any proper suffix u of w is comparable with X} 

for the prefix order. 
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Indeed, if u is comparable with X for the prefix order, then it is comparable with X% 
(since X C X;); if on the other hand, u € I(X), then u has a prefix v in X’. Then v 
is a proper factor of w, pengo ve Xi_,andue far is comparable with X&+ for the 
prefix order by Lemma elow. Thus Lemma applies (with T = X} and n = k), 
and we deduce that Lp is bounded on X;. 


LEMMA 6.8 Let T C X’ and m,£ be two integers such that m < £. If (Lm, w) < £ for 
any w € T, then every word in TA* is comparable for the prefix order with a word in 
Xeo- 


Proof Define W; = P(Ve4;) NT A*, i > 0. The main argument consists in showing that 
each v € Ws has a prefix in X¢4,. 

For this, take a word v € W,. Since v € Veig, we have Mera BEY +oa+1. Let 
i € {0,...,o7}. Then Xei C Xeo and thus we have by Lemma (5) (Lepi v) = 
(Leyo,v) =l+o+) > €+i+1. 

Thus by Lemma Efo for some prefix p; of v, one has (Lepi, pi) = 2+i+1 and 
therefore p; € Vipi. We may even assume that p; € P(Vo4;). 

Now, v = tu with ¢ € T and u € A*. We have +i+1 > ¢ 2 (Lm,t) (by the 
hypothesis in the Lemma) > (Lepi, t) (by Lemma ZIO singe Xm C Xeri). Since 
(Lipi pi) =L+i+1, t must be a prefix of p; by Proposition Et thus pi E€ TA* and 
therefore p; € W;. Suppose by contradiction that v € I(Xe+o). Let i € {0,...,0— 1}. 


<£ L+0o+1 
Figure 4.14: The respective positions of p;, v and t 


We show first that p; € I(Xe+;i). Indeed, p; cannot have a prefix in Xg+;, since this 
word would be prefix of v, hence comparable with v, and we have assumed that v is 
not comparable with X¢,,, which contains X¢,;; suppose that p; is a prefix of some 
x € Xei; since p; € TA*, the word p; has a prefix t in T, and t is also a prefix of x. Since 
T C X' by assumption, the word t is incomparable with any word of X. There Ore g js 
not in X. Thus by Lemma 6.3, (Leyi, £) < €+7%, which implies by Proposition i A that 
(Lesi, pi) < (Lepi; £) < €+%. But we know that p; € Wi C Vipi > (Lepi, pi) = L+i+1, 
a contradiction. 

Since p; is in [(Xp4;), it is in Zepi. Now, pi Z Xe4i41 (otherwise v has a prefix in 
Xe+i+1 C Xe+o, which contradicts our assumption above v € I(Xe+0)). Thus we must 
have p; € AT X, since Zepi \ ATX C Xepipı. aN pi is a proper 2 of pi+1 : indeed, 
if on the contrary p;4, is a prefix of pi, eee +i+1= +i» Pi) > (Lipi, Pi+1) 2 
(Lesin1, pi+1) = L+i +2 (by Proposition F ani Lemma Bas) „a contradiction. 

Thus we obtain a chain of o suffixes of X, a contradiction Ta thie definition of ø. 

We conclude that v € I(X¢4,) : for some x € Xeo, v and x are comparable. If v is a 
prefix of x, then x ¢ X (otherwise, prefix t of v which is in T is a word in X’, comparable 


Version 22 janvier 2008 J. Berstel, D. Perrin and C. Reutenauer 


6. Completion 227 


a contradiction by Proposition s x is a prefix of v. Thus we have shown that 
each word in W, has a prefix in X¢4,. 

Let ngw_q = tu be any word in T'A* with t € T. We have cpa (Lm,t) (by 
Lemma E5) <é<+o+1. Thus, by Proposition [Patan Lemma 2), assuming 
that X¢4, is not maximal (otherwise there is nothing to prove), there is some word u’, 
comparable with u for the prefix order, such that v = tu’ € W,. By what we have 
already shown, v has a prefix in X¢;, and we conclude that w is comparable with a 
word in Xpi,. m 


=3 
with X); hence by Lemma ba L ty) <fl+o. Now, (Leo, v) = €+0 +1, which is 
4 Thu 


=3 = 

Proof of the theorem. By Lemma bofand ET Le is bounded on X’ for some p-p hus 
we may find £ such that k < £ and (Ly,w) < £ for any w in X’. Lemma fast ow 
implies that every word in X’A* is comparable for the prefix order with a word in 
Xio. Let w € A*. If w is not comparable with a word in X, then it is in X’A*, and 
therefore is comparable with a word in Xo. Thus any word in A* is comparable for 
the prefix order, with some word in X¢4¢. his, shows that Xi, is a maximal bifix 
code containing X. It is rational by Lemma Er Hence the theorem is proved. n 


representing X, viewed as prefix code, is in Figure on the left where the values of 
the indicator on the prefixes are indicated. 


We give now an example which may be n Figure BE fet X = Xo = ba*bb. The tree 


3 
Figure 4.15: The prefix codes X = ba*bb and X’ = aU ba*ba 


It follows that I(X) = aA* Ub?aA* U babaA* U baba A’... 
= aA* Uba*baA*. 2 
b 
Thus X’ = aUba*ba. The prefix code X’ is indicated in Figure kreon the right with 


the values of Lo on its prefixes. It is easy to see that, by definition of Lo, (Lo, a) = 2 pes Pa 


(Lo, ba"ba) = n + 4, since a and ba”ba have no factor in X. Hence, by Proposition [L4 
(Lo, w) > 2 for any w in I(X) = (aUba*ba)A* and we deduce that Zo = Ø. Thus X = X; 
and I(X) = I(X1). We have now that the only possible word in Z; = I(X1) A she 
a; thus Z, = {a} and Xə = X, U {a} = aU ba*bb, since a ¢ ATX (see Figure 3 

Now, I(X2) = ba*baA*. We have Lo, ba"ba) = n + 4 — (n + 1) = 3, since the only 
factor of ba”ba in Xə is a, with multiplicity n + 1. Moreover (L2,ba"b) = 3, hence 
ba"ba ¢ P(V2) and likewise, no w in I(X2) is in P(V2). This implies that Zə = 0 
and X3 = Xə. We now have Z3 = P(V3) N I(X3) = ba*batb. Indeed for n,m > 0 
L3(ba"ba™) = 3 and L3(ba"ba'™b) = 4. Thus X4 = aU ba*bb U ba*batb = aU ba*ba*b. 
It is easily checked that I(X4) = Ø and thus X4 is right complete, hence maximal. 
Observe that X’ = a U ba*ba, also F; = {a} and Fə = ba*ba. 
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3 3 


Figure 4.16: The bifix codes Xə = aU ba*bb and X4 = a U ba*ba*b 


7 Exercises 


g section3.1 
Section 


1.1 Let X C At be a bifix code and L = Ly its indicator. Show that if for u,v € A* 
we have (L, uvu) = (L, u), then for all m > 0, (L, (uv)™u) = (L, u). 


. section3.2 
Section 


2.1 Let X C At bea thin maximal prefix code. To each word w =€ aiaz- -an F(X) 
with a; € A, we will associate a function pw from {1,2,...,n} into itself. 


(a) Show that for each integer i in {1,2,...,n}, there exists a unique integer k € 
{1,2,...,n} such that either ajaj41--- ap Or GjQ;41 +++ Gna +++ az is in X. Set 


Pw(t) =k. 


This defines, for each w € F(X), a mapping pw from {1,2,...,|w|} into itself 
(b) Show that X is suffix iff the function pw is injective for all w € F(X) 
(c) Show that X is left complete iff the function pw is a surjection for all w € F(X). 
(d) Derive from this that a thin ximal prefix code is suffix iff it is left complete 
(cf. the proof of Proposition Bie 


2.2 Let P = {ww | w € A*} be the set of palindrome words of even length. 


(a) Show that P* is biunitary. Let X be the bifix code for which X* = P*. Then 
X is called the set of palindrome primes. 
(b) Show that X is left complete and right complete. 


2.3 Show that two maximal bifix codes which are obtained one from the other by 
internal transformation are either both recognizable or both not recognizable. 


2.4 Show that a maximal bifix code X C A? is a group code if and only if for any 
u,v,w,r E€ A*, 
uv, uw,rv E€ X* > rw € X* (4.7.1) 
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i section3.3 
Section 


3.1 Let P, S € Z(A) be polynomials such that 
P(1—A) =(1—A)S. 
Show that there exist an integer d € Z and a polynomial Q € Z(A) such that 
P=(1-A)Q+d; S=Q(1-A)+d 


(Hint: show for this that if two homogeneous polynomials P,Q € Z(A) satisfy PA = 
AQ, then P = AR,Q = RA for some polynomial R.) 


exo3.3.2| 3.2 Let X be a thin maximal bifix code of degree d. Let w € H(X) and let 


1 = pi, p2,.--,Pa 
be those of the suffixes of w which are proper prefixes of X. Set 
Y; = p X fr 2<i<d, Y,=1. 


Show that each Y; is a maximal prefix set, an that the set S' of proper suffixes of X is 
the disjoint union of the Y;’s (cf. Theorem B.8)- 


3.3 Let X be a thin maximal bifix code of degree d and let S be the set of its proper 
suffixes. Show that there exists a unique partition of S into a disjoint union of d prefix 
sets Y; satisfying 

Y;-1 C YA for 2<i<d. 


(Hint: Set Yg = S N H(X).) 


. section3.4 
Section 


4:3 
ex03.4.1| 4.1 Let X be a finite bifix code. Show, using Theorem EF tiat there exists a recog- 
nizable maximal bifix code containing X. 


ex03.4.2| 4.2 Show that if X is a recognizable maximal bifix code of degree d > 2, then the 
derived code is recognizable. 


exo3.4.3| 4.3 Let X be a thin maximal bifix code of degree d > 2. Let w € H(X), and let s be 
the longest prefix of w which is a proper suffix of X. Further, let x be the prefix of w 
which is in X. Show that the shorter one of s and z is in the derived code X’. 


exo3.4.4| 4.4 Let Xı and Xə be two thin maximal bifix codes having same kernel: K(X1) = 
K(Xə). Set 
X =X AX 


, 4.2 es , l : f ; 

(cf. Exercise EI sfiow that X is thin, maximal and bifix. Use this to give a direct 
proof of the fact that two thin maximal bifix codes with same kernel and same degree 
are equal. 
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i section3.5 
Section 


5.1 Let X bea finite maximal bifix code. Show that if a word w € A? satisfies 


pwq = rws € X (4.7.2) 


for some p,q,r,s € At, and p Æ r, then w € p when X’ is the derive gade of X. 
(Hint: Start with a longest word satisfying ( and use Proposition B-A) 
3.5.1 


5.2 For a finite code X, let I(X) = max{|z| | £ € X}. Show, using Exercise [I], tha 
if X is a finite maximal bifix code over a k letter alphabet, then 


UX) < U(X’) + k- 


with X’ denoting the derived code of X. 
Denote by A(k, d) the maximum of the lengths of the words of all finite maximal bifix 
codes of degree d over a k letter alphabet. Show that for d > 2 


Mk, d) < A(k,d — 1) + H4-Y-1 
-5.2 
Compare with the bound given by Theorem Ba 


5.3 Let X C At be a finite maximal bifix code of degree d. Let a,b € A, and define 
a function y from {0,1,...,d — 1} into itself by 


were ex 
Show that y is a bijection. 


5.4 Show that for each k > 2, the number (;,(d) of finite maximal bifix codes of degree 
d over a k letter alphabet is unbounded as a function of d. 


5.5 A quasipower of order n is defined by induction as follows: a quasipower of order 
0 is an unbordered word. A quasipower of order n+ 1 is a word of the form uvu, where 
u is a quasipower of order n. Let k be an integer and let a, be the sequence inductively 
defined by 

a, =k+1, 


Ont1 = An(k%" +1) (n> 1) 


Show that any word over a k letter alphabet with length at least equal to a, has a 
factor which is a quasipower of order n. 


5.6 Let X be a finite maximal bifix code of degree d > 2 over a k letter alphabet. 
Show that 
< Qad- 2 
max |a| < agi + 


3.5.5 A1 
where (an) is the sequence defined in Ege, cise Pa tt: Use Exercise i i Compare 


with the bound given by Exercise 
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5.7 Show that the number of finite maximal bifix codes of degree 4 over a two-letter 
alphabet is 
(2(4) = 73. 


5.8 Let X be a thin maximal biprefix code of degree d on k letters. Let S be the set 
of its suffixes and let (U;)1<i<q be disjoint maximal prefix codes such that S' is their 
union. Let R; be the set of prefixes of U;. Define t(z) = 8 ur, (z). Show that 


ux(z) —1= (kz —1)d + (kz — 1)*t(z). 


5.9 Let X be a thin maximal Diprenx code on k letters of degree d. Let u = (Un)n>1 
be its length distribution. We have ¢u'(1/k) = d, where the last expression can be 
viewed as the average length of the eae of X with respect to the uniform Bernoulli 
distribution. Show that the variance of the lengths is given by 


vx = 2t(1/k) +d- @? 


. g . blemTower . . 
where t(z) is defined in Exercise Eg Recall that the variance is the mean of the squares 
of the length minus the square of the mean of the lengths. 


. section3.5bis 
Section 


6.1 Show that if X is a prefix code, then Y = X U X’ is a maximal prefix code. Show 
that if X is rational, so is Y. 


8 Notes 
The idea to study bifix codes goes back to (L956) and 


l d (959). These papers already contain significant results. The first systematic 


study is in Schtitzenbergey (961b), Bchützenberger]| (L961a) 
Propositions Pana p4 Eion from Schützenberger ((1961a Kani The internal transforma- 


tion appears in (L961a)). The fact that all finite maximal bifix ç des, 


can be obtained from the ee codes by internal transformation (Theorem is 
from (L982). The Gt that the average length of a thin maximal bifix code 
is an integer (Corollary BS ead in (Gilbert and Moord and Moore ( E Ihi is proy. q ip 
(961b) wit the, methods ‘vk Cp in ae eorem an 
its converse eee appear in Perrin] ((L977a)). The notion of derived code 


is due to (1979). The presentation of the results given here, using the indicator 
and the tower, appears 


ee ae «(ties 
The results of Section are a generalization to thin codes of results in (L979). 
The methods of this section are used in ({L982)) to prove that any finite bifix code 


is contained i 4 recognizable maximal bifix code (Exercise : 

Trg pens already in ({L961b) with a different proof (see 
Exercise ~The rest of this section is due to (1979). The enumeration of finite 
maximal bifix codes over a two-letter alphabet has been pursued by computer. A first 
program was written in 1975 by C. Precetti using internal transformations. It produced 
several thousands of them for d = 5. In 1984, a program written by M. Léonard using 
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the method of Corollary Soave the exact number of finite maximal bifix codes of 
degree 5 over a two-letter alphabet. This number is 5,056 783. 
Bifix codes and their length distributions of have been studied with a practical mo- 
tivation, a the name of — variable-length codes (see Y. Takishima andi ia 
(i995); [G 1] (095); Ye and Young] $001). Proposition P7 
is from 1] (1996) It has been conjectured by Ahlswede and Kachatrian 
that for any sequence u = (un) of integers satisfying u(1/2) < 3/4 there exists a binary 
bifix code X such that ux = u. We actually conjecture that this is true for any number 
of letters, namely that for any sequence u = (un) of integers satisfying u(1/k) < 3/4 


there exists a biprefix c code X on k letters such that ux = u. 
Theorem [ő 6-1 is due to Z hang and Sher] (L995). For the proof of the theorem, we have 
followed |B (L999). . E 
Exercise P agi appears in [Long] (L996). Exercise B i is a very special case 


Cohn. called “weak Euclidian algorithm” in (1985). Exercises B 5-3 
Pa are from ((L979)). 
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Circular Codes 


0 Introduction 


In this chapter we study a particular family of codes called circular codes. The main 
feature of these codes is that they define a unique factorization of words written on a 
circle. The family of circular codes has numerous interesting properties. They appear 
in many problems of combinatorics on words, several of which will be mentioned here. 

In Section [we give the definition of circular codes and we characterize the submonoid 
generated by a circular code. We also describe some elementary properti sof circular 
codes. In particular we characterize maximal circular codes (Theorem ; 

In Section P| we introduce successive refinements of the notion of a circular code. 
For this we define two notions, namely (p,q)-limitedness and synchronization delay. 
We study the relations between these parameters. For finite codes we show that, these 
notions allow us to classify completely the family of circular codes (Theorem j. We 
then proceed to a more detailed study of (1, 0)-limited code Lhes godes are exactly 
the left factors f, bisections studied in Sections REP reese r we show 
(Proposition partie (1,0)-limited codes correspond to ordered automata. Comma- 
free codes are defined as circular codes satisfying the strongest possible condition. 

ction/. 

Section Be concerned with length distributions of circular codes. Two important 
theorems are proved. The first gives a characterization of sequences of integers which 
are the length distribution of a circular code (Theorem j. The second shows that 
for each odd integer n there exists a system of representatives of conjugacy classes of 

rj ' itive words of length n which not only is circular but even comma-free (Theorem 

. The proofs of these results use similar combinatorial constructions. As a matter 
of fact they are based on the notion of factorization of free monoids studied in the next 
chapter. 


1 Circular codes 
We define in this section a new family of codes which take into account, in a natural 
way, the operation of conjugacy. 


By definition, a subset X of AT is a circular code if for all n,m > 1 and 271, £2,..., 2n € 
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X, Y1,Y2;,.--,Ym E X and p € A* and s € A’, the equalities 


ST2T3:':LnP = Y1Y2:*: Ym, (5.1.1) 
n= p 612) 


imply 


n= m; p=1 and zi= yi (<i<n) 


, 1 
(see Fig. Ba. . . . As 
A circular code is clearly a code. The converse is false, as shown in Example i F 
The asymmetry in the definition is,only apparent, and comes from the choice of the 


cutting point on the circle in Fig. | Clearly, any subset of a circular code is also a 
circular code. 


Figure 5.1: Two circular factorizations. 7_01 


Note that a circular code X cannot contain two distinct conjugate words. Indeed, if 
ps, sp E€ X with s,p € At then 


s(ps)p = (sp)(sp). 
Since X is circular, this implies p = 1 which yields a contradiction. Moreover, all words 
in X are primitive, since assuming u” € X with n > 2 , it follows that 

u(u” ju”! = u”u”. 


This implies u = 1 and gives again a contradiction. 

We shall now characterize in various ways the submonoids generated by circular 
codes. The first characterization facilitates the manipulation of circular codes. 

A submonoid M of A* is called pure if for all x € A* and n > 1, 


t EMs>rEM (5.1.3) 


or equivalently 
rEMs>V/reM. 


Version 22 janvier 2008 J. Berstel, D. Perrin and C. Reutenauer 


1. Circular codes 235 


A submonoid M of A* is very pure if for all u,v € A*, 


uv,vue M => u,v E€ M. (5.1.4) 


ks .1.3 
A very pure monoid is pure. The converse does not hold (see Example ES 


PROPOSITION 1.1 A submonoid M of A* is very pure iff its minimal set of generators 


is a circular code. 


Proof. Let M be a very pure submonoid. We show that M is stable. Let m, m’, £m, m'x € 
M. Then setting u = z,v = mm’, we have uv, vu € M. This implies x € M, g 
s,M is stable, hence M is free. Let X be its base. Assume that Bi and 
hold. Set u = s,v = x9x%3---a%np. Then uv,vu € M. Consequently s € M. 
Since ps, £t2£3'*- Enp E M, the stability of M implies that p € M. From ps € X, it 
follows that p = 1. Since X is a code, this implies n = m and z; = y; for i = 1,...,n. 
Conversely, let X be a circular code and M = X*. To show that M is very pure, 
consider u,v € A* such that uv, vu € M. Set 


UU = T172 :': Tn, vu = Y1Y2:':: Ym, 


with x;,y; E€ X. There exists an integer 7, 1 < i < n such that 


U = T1T2''' Ti—1P, V = SXi41°°° Tn 


with x; = ps,p € A*,s € At. Then vu may be written in two ways: 


SUj41 `’ Un %Q°++* Li—1P = Y1Y2°°* Ym. 


Since X is a circular code, this implies p = 1 and s = yı . Thus u,v € M, showing 
that M is very pure. n 


ex7.1.1| EXAMPLE 1.1 Let A = {a,b} and X = a*b. Then X* = A*bU 1. Thus if uv, vu € X*, 
the words u,v either are the empty word or terminate by a letter b; hence u,v € X*. 
Consequently X* is very pure and X is circular. 


ex7.1.2| EXAMPLE 1.2 Let A = {a} and X = {a°}. The submonoid X* clearly is not pure. 
Thus X is not a circular code. 


ex7.1.3| EXAMPLE 1.3 Let A= fab} and X = {ab, ba}. The code X is not circular. However, 
X* is pure (Exercise a 


The following proposition characterizes the flower automaton of a circular code. 


st7.1.2| PROPOSITION 1.2 Let X C A% be a code and let y be the representation associated 
with the flower automaton of X. The following conditions are equivalent : 


(i) X is a circular code. 
(ii) For all w € At, the relation p(w) has at most one fixed point. 
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Proof. For convenience, let 1 denote the state (1,1) of the flower automaton A(X). 

(i) => (ii) Let w € At, and let p = (u,v), p’ = (u’,v’) be two states of A} (X) which 
are fixed points, i.e., such that fy p) = Ww, p(w), p’) = 1. 

Since w Æ 1, Proposition VIN. shows that w € vX*u and w € v'X*u’.Thus both 
paths c : p —> p and d : p' —> p' pass through the state 1. 

We may assume that v < v’. Let z,t € A* be the words such that v’ = vz and 
w = vzt. Then the paths c,c' factorize as 


cC: p 21r > P, Cip >s— 1 tp! 


Thus there are also paths 


d:1 5r — p51, 1— p >s 1 


showing that ztv, tuz € X*. Since X* is very pure, it follows that z,tv € X*. Conse- 
quently, there is a path e : 1 —> 1 Hj, By unambiguity, d = e, whence r = 1. Thus 
1—5 p —> 1 which compared to d’ gives p = p'. This proves that (w) has at most 
one fixed point. 

(ii) => (i) Let u,v € A* be such that uv,vu € X*. Then there are two paths 
1 = p = 1 and 1 = q > 1. Thus the relation y(wv) has two fixed points, namely 
1 and q. This implies q = 1, and thus u,v € X*. n 


COROLLARY 1.3 A thin circular code is synchronous. 


Proof. Let p be the representation associated with the flower automaton A7,(X) of X. 
sS (AT) be an idempotent with positive minimal rank. According to Proposition 

eS the rank of e is 1. Thus d(X) = 1. 

We now give a characterization of circular codes in terms of conjugacy. For this, the 
following terminology is used: 

Let X C At be a code. Two words w,w’ € X* are called X-conjugate if there exist 
x,y E€ X* such that 

w= zty, w =y. 


The word « € X* is called X-primitive if x = y” with y € X* implies n = 1. The 
X-exponent of x € X* is the unique integer p > 1 such that x = y? with y an X- 
primitive word. Let a: B — A* be a coding morphism for X. It is easily seen that 
w,w’ € X* are X-conjugate iff a~!(w) and a7!(w’) are conjugate in B*. Likewise, 
x € X* is X-primitive iff a7! (x) is a primitive word of B*. 

Thus, X-conjugacy is an equivalence relation on X*. Of course, two words in X* 
which are X-conjugate are conjugate. Likewise, a word in X* which is primitive is also 
X-primitive. When X = A, we get the usual notions of conjugacy and primitivity. 


PROPOSITION 1.4 Let X C A? be a code. The following conditions are equivalent. 


(i) X is a circular code; 
(ii) X* is pure, and any two words in X* which are conjugate are also X- 
conjugate. 
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Proof. 

(i) = (ii) Since X* is very pure, it is pure. Next, ] t w, w’ € X* be conjugate words. 
Then w = uv, w’ = vu for some u,v € A*. By E € X*, showing that w and 
w’ are X-conjugate. 

(ii) => (i) Let u,v E€ A* be such that uv, vu € X*. If u=1 or v = 1, then u,v E€ X*. 
Otherwise, let x,y be the primitive words which are the roots of uv and vu: then 
uv = x”, ,vu = y” for some n > 1. Since X* is pure, we have x,y € X*. Next 
uv = x” gives a decomposition x = rs,u = x?r,v = sx? for some r € A*,s € At and 
p+q+1 =n. Reporting this in the equation vu = y”, yields y = sr. Since x,y are 
conjugate, they are X-conjugate. But for primitive words x,y, there exists a unique 
pair (r,s’) € A* x A*t such that x = r's’,y = s'r’. Consequently r,s € X*. Thus 
u,v E€ X*, showing that X* is very pure. n 


PROPOSITION 1.5 Let X C A* be a code and let C C A” be a conjugacy class that 
meets X*. Then 


1 1 
X` —Card(X™ N C) > —Card(C). (5.1.5) 
n n 
Further, the equality holds iff the following two conditions are satisfied: 
(i) The exponent of the words in C N X* is equal to their X -ezponent. 
(ii) CO X* is a class of X-conjugacy. 
Proof. Let p be the exponent of the words in C. Then Card(C) = n/p. The set CN X* 
is a union of X-conjugacy classes. Let D be such a class, and set C’ = C — D. The 


words in D all belong to X* for the same k, and all have the s me X-exponent, say q. 
Then Card(D) = k/q. Since C = C’U D, the left side of EAS 


n n 
1 m 1 1 m 
5 = Card(X AC) + `> z Card(X ND). 
m=1 m=1 
In the second sum, all terms vanish except for m = k. Thus this sum equals 
(1/k)Card(X* N D) = 1/q. Thus 


> ~Card(X” NC) = D —Card(X™ N C’). (5.1.6) 


m=1 m=1 


Since q < p, we have 1/q > 1/p = (1/n) Card(C). This proves formula (Bea 


Assume now that (i) and, (ij) hold. Then p = q, and D = CN X*. Th C'N 
Thus the right side of EFS equal to 1/ hi ch shows that equali holds i in ( 


Ey 


P 
Conversely, assuming the equality sign in jean follows from oat 
1 1 Ql 1_1 
==- + 5 —Card( X” NC") > =- > =, 
p q “<~m q p 
which implies p = q and C’ N X* = 9. m 


The proposition has the following consequence: 


PROPOSITION 1.6 Let X C At be a code. The following conditions are equivalent: 
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(i) X is a circular code. 
(ii) For any integer n > 1 and for any conjugacy class C C A” that meets X*, 
we have i i 
X —Card(X™ N C) = —Card(C). (5.1.7) 
m n 
Proof. By Proposition feat tie code X is circular iff we have 
(iii) X* is pure. 
(iv) Two conjugate words in X* are X-conjugate. 
Condition (iii) is equivalent to: the X-exponent of any word in X* is equal to its 
exponent. Thus X is circular iff for any conjugacy class C meeting X*, we have 
(v) The exponent of words in C N X* equals their X-exponent. 
(vi) C N X* is a class of x conjugacy. 
In view of Proposition [[-df co jtions (v) and (vi) are satisfied iff the conjugacy class 
C N A” satisfies the equality oman proves the ee n 


We now prove a result which is an analogue of Theorem 6-1. 


PROPOSITION 1.7 Let X C At be a circular code. If X is maximal as a circular code, 
then X is complete. 


Proof. If A = {a}, then X = {a}. Therefore, we assume Card(A) > 2. Suppose that X 


is not complete. en here is a word, say w, which is not a factor of a word in X*. 
By Proposition 08.0, there is a word v € A* such that y = wv is unbordered. 


Set Y = X Uy. We prove that Y is a circular code. For this, let x; (1 < i < n) and 
yi (1 < i < m) be words in Y, let p € A*,s € A* such that 


SXLQL3°°* InP = Y1Y2' ` Ym 
zı = ps 


If all z; (1 < i < n) are in X, then also all y; are in X, because y is not a factor of a 
word in X*. Since X is circular, this then implies that 


n=m, p=l1, Ti =t <i<n). (5.1.8) 


Suppose now that x; = y for some i € {1,...,n} and suppose first that i 4 1. Then z; 
is a factor of y1Y2:-- Ym. Since y ¢ F(X*), and since y is unbordered, this implies that 
there is a j € {1,2,...,m} such that y; = y, and 


ST2' +: Li-1 = Y1Y2'*' Yj—1, Yi+1 °° TnP = Yj+1 t Ym. 
This in turn implies 
$LQ° °° Li—1i+1''' TnP = Y1Y2:'' Yj—1Yj+1 Ym, 


-8 
and (Ataf follows by induction on the length of the words. Consider finally the case 
where i = 1, i.e., zı = y. Since 


TIT Enp = PY1Y2°°* Ym, 
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we have y£2 -np = Py1Yy2°-: Ym. Now p is a suffix of a word in Y*; ugther y ¢ F(X*) 
and y is unbordered. Thus p = 1 and yı = y. This again gives y induction on 


the length of the words. Thus if X is not complete, then Y = X Uy is a circular code. 
Since y ¢ X, X is not maximal as a circular code. n 


I 
The preceding proposition and Theorem Hea imply 


THEOREM 1.8 Let X be a thin circular code. The three following conditions are equiv- 
alent. 

(i) X is complete. 

(ii) X is a maximal code. 

(iii) X is maximal as a circular code. 


Observe that a maximal circular code X C A* is necessarily infinite, except when 

Tl 4 Indeed, assume that X is a finite maximal circular cede. Then by Theorem 

it is a maximal code. According to Proposition PETE er is, for each letter 

a € A, an integer n > 1 such that a” € X. Since X is circular, we must have n = 1, 

and consequently a € X for alla € A. Thus X = A. We shall need the following 
property which allows us to construct circular codes. 


PROPOSITION 1.9 Let Y,Z be two composable codes, and let X =Y o Z. IfY and Z 
are circular, then X is circular. 


Proof. Let a: B* — A* be a morphism such that X = Y og Z Let u,v € A* be such 
that uwv,vu € X*. Then uv,vu € Z*, whence u,v € Z* because Z* is very pure. Let 
s=a'(u),t =a !(v). Then st,ts € Y*. Since Y* is very pure, s,t € Y*, showing 
that u,v € X*. Thus X* is very pure. n 


2 Limited codes 


We introduce special families of circular codes which are defined by increasingly restric- 
tive conditions concerning overlapping between words. The most special family is that 
of comma-free codes which is the object of an important theorem proved in the next 
section. 

Let p,q > 0 be two integers. A submonoid M of A* is said to satisfy condition C (p, q) 
if for any sequence 


U0, U1,+++,Upt¢ 
of words in A*, the assumptions 
uju; © M (1 <i<pt+ q) (5.2.1) 
imply 
Up E M 


(see Fig. Baa. 
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For example, the condition C(1,0) simply gives 
weEeM>veMmM 
i.e., M is suffix-closed, and condition C(1,1) is 
uv,yweE M >veEe M. 


It is easily verified that a submonoid M satisfying C(p,q) also satisfies conditions 
C(p',q’) for p > p,q’ >q. 


PROPOSITION 2.1 Let p,q > 0 and let M be a submonoid of A*. If M satisfies condi- 
tion C(p,q), then M is very pure. 


Proof. Let u,v € A* be such that uv, vu Define words u;(0 < i < p + q) to be 
alternatingly u and v. Then assumption ( ris satisfied and consequently either u 
or v is in M. Interchanging the roles of u and v, we get that both u and v are in M. 


Figure 5.2: The condition C(p,q) (for p odd and q even). 


Let M be a submonoid satisfying a condition C(p,q). By the preceding proposition, 
M is very pure. Thus M is free. Let X be its base. By definition, X is called a 
(p, q)-limited code. A code X is limited if there exist integers p,q > 0 such that X is 
(p, q)-limited. 


PROPOSITION 2.2 Any limited code is circular. m 
EXAMPLE 2.1 The only (0,0)-limited code over A is X = A. 


EXAMPLE 2.2 A (p,0)-limited code X is prefix. Assume indeed X is (p,0)-limited. If 
p=Othen X = A. Take up = ++: = up—2 = 1. Then for any up_1, up, we have 


Up—1, Up—1Up E X* = Up © X* 


showing that X* is right unitary. Likewise, a (0, q)-limited code is suffix. 
However, a prefix code is not always limited, since it is not necessarily circular. 


EXAMPLE 2.3 The code X = a*b is (1,0)-limited. It satisfies even the stronger condi- 
tion 


wweEX > vexul. 


EXAMPLE 2.4 Let A = {a,b,c} and X = ab*cUb. The set X is a bifix code. It is 
neither (1,0)-limited nor (0, 1)-limited. However, it is (2,0)- and (0, 2)-limited. 
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EXAMPLE 2.5 Let A = {a | i > 0} and X = {ajaj41|t > 0}. The code X is circular, as 
it is easily verified. However, it is not limited. Indeed, set u; = a; for 0 < i < n. Then 
uiiu; E X for i € {1,2,...,n}, but none of the u; is in X*. 


This example shows that the converse of Proposition EH aves not hold in general. 
However it holds for finite code , AS We shall see below (Theorem 2.5). It also holds for 
recognizable codes (Exercise BS 

One of the reasons which makes the use of (p,q)-limited codes convenient, is that 
they behave well with respect to composition. In the following statement, we do not 
use the notation X = Y o Z because we do not assume that every word of Z appears 
in a word in X. 


PROPOSITION 2.3 Let Z be a code over A, let 3 : B* — A* be a coding morphism 
for Z, and let Y be a code over B. If Y is (p,q)-limited and Z is (r,t)-limited, then 
X = B(Y) is (p +r,q + t)-limited. 


AD 


Figure 5.3: X is not (p,q)-limited for p+q <3 


Proof. Let uo, u1... Up+r+q+t E A* be such that 
uiiu; E X* (<i<p+r+q+t) (5.2.2) 
Since X C Z* and Z is (r,t)-limited, it follows from (Ay that 
Ur, Ur+1,-- - , ag E Z* (5.2.3) 


3 2 
Since Y is (p, q)-limited, Birand Eor r+1 <i < p+q+r show that ur+p € X*. 
Thus X is (p + r,q + t)-limited. n 


EXAMPLE 2.6 Let A = {a,b,c,d} and X = {ba, cd, db, cdb, dba }.Then 
X = Z1 0 Z2 0 Z3 0 Z4, 
with 


Z4 = {b,c,d, ba} 
Z3 ie) Z4 {c, d, ba, db} 
Zo O Z3 O Z4 = {d, ba, db, cd, cdb}. 


The codes Z3 and Z4 are (0,1)-limited. The code 33 Za is not (0,1)-limited, but it 
is (0, 2)-limited, in agreement with Proposition Be The codes Zı and Zə are (1,0)- 
limited. Thus X ‘ge timies It is not (p, q)-limited for any (p, q) such that p+q < 3, 
as shown by Fig. i 
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Let s be an integer. A code X C A* has synchronization delay s if 


crEeXx®, u,vEe A, uve X* > uz,xv € X*. (5.2.4) 


If X has a synchronization delay s for some s, then it is said to be uniformly synchro- å 
nized. If X is uniformly synchronized, then the minimal integer s such that ( 
holds is called the minimal decihpering delay of X. 


PROPOSITION 2.4 Any uniformly synchronized code is limited. 


Proof. Let X C A* be a uniformly synchronized code, and let s he its minimal 
synchronization delay. We show that X is (2s,2s)-limited. (see Fig. 
indeed words 


. Consider 


ugo, U1, .-.,U4s E An 
and assume that for 1 < i < 4s, 
Uj—-1u; E X*. 
Set, for 1 < i < 2s, 
Ti = U2i—2U2i—1, Yi = U2i—1U2i 


Let y = y1yY2 + Ys and £ = 21 y2°-+ Ts. 


Figure 5.4: An X-factorization with s = o (X). 7_04 


Assume first that y; Æ 1 for all i = 1,...,s. Then y E€ X*°X*. Since ugyuas41 € X*, 
the uniform synchronization shows that uoy € X*. Since ugy = £u2zs, this is equivalent 


to 
Tug, € X*. (5.2.5) 


Next, consider the case that y; = 1 for some i € {1,2,--- ,s}. Then uzi-1 = ug; = 1. 
It follows that 


Yi+1 tt Ys = U2i+1 ` ' t U2s = Titi’ TtU2s 


Thus, in this case also rug, is in X*. 
Setting y’ = Ys+1***Y2s, We prove in the same manner that 


ursy’ € X* (5.2.6) 


:5 -6 
Since X* is stable, (sand (20) imply that uo, € X*. This shows that X is 
(2s, 2s)-limited. 7 
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PLE 2.7 oe the (2,2)-limited code X = {ba, cd, db, cdb, dba} of Example 
Fae o(X) = 1. Indeed, the contexts of words in X are 


= { 


Q 

os 

Q 

a 

> 
ne A 

II 
SAAS 


Thus each x € X is a constant, and therefore o(X) = 1. 


-2.4 
EXAMPLE 2.8 Let X = ab*cUb be the limited code of Example Eat tris not uniformly 
synchronized. Indeed, for all s > 0, one has b° € X* and ab’c £X . However abf, bfc ¢ 
X. This example shows that the converse of Proposition does not hold. 


We now prove that in the case of finite codes, the concepts introduced coincide. 


THEOREM 2.5 Let X be a finite code. The following conditions are equivalent. 

(i) X is circular. 

(ii) X is limited. 

(iii) X is uniformly synchronized. 
Proof We have already established the implications (iii) = (ii) = (i) without the 
finiteness assumption. Thus it remains to prove (i) > (iii). 

Let X C A” be a finite circular code, and let A} (X) = (P,1,1) be the flower 
automaton of X with the shorthand notation 1 for the state (1,1). By Proposition 
Paton element in gp(A*) has at most one fixed point. In particular, every nonzero 
idempotent in gp(A*) has rank 1. Let M = yp(A*), and let J be its 0-minimal 
ideal. Since X is finite, the monoid M is finite. Let s = Card(M — J). We show that 
ypp(w) € J for every word w of length s. Indeed, set w = aja2---ag, with a; E A, and 
set Mm; = Yp(a1a2:--a;) for i = 0,...,s. If the m; are pairwise distinct, then at least 
one of them is in J, and consequently yp(w) € J. On the contrary, there are indices 
i, j, with 0 < i < j < s, such that m; = mj. Set u = yp(ai41-:-a;). Then meu = mj. 
Let e be the idempotent which is a positive power of u. (Recall that M is finite). Then 
mje = mj. Further, e has minimal rank; thus e € J. Consequently, m; € J and also 
yp(w) € J. This proves the claim. 

We now show that X) has synchronization delay s. Indeed, let x € X® and let 
u,v € A* be such that uxv € X*. Since |z| > s, one has yp(x) € J. Thus yp(z) has 
rank 1. Consequently, there are two matrices a € {0,1}”** and 8 € {0,1}'*? such 
that yp(x) = aß. Since uzv € X*, there is a path 1 “> p “+ q > 1. Hence pat(q. 
From x € X*, one gets lat@1. It follows that pat@1 and lat3q. Therefore p —> 1 and 
1 Æ q, showing that ux, zv € X*. This shows that X is uniformly synchronized. n 


EXAMPLE 2.9 Let A = {a1,a2,...,a@2k} and 
X ={aja;|1 <i<gj < 2k} 


We show that X is uniformly synchronized and o(X) = k. First, o(X) > k since 
(a2a3)(a4a5) +» (a2k-2a2k-1) E€ X*~! and also (ajaz): -+ (a2k-1a2k) € X* and however 
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a1a2::a2k-1 É X*. Next, assume that x € x, and uxv € X*. If u and v have even 
length, then they are in X*. Therefore we assume the contrary. Then u = u’a;,v = 
aw’ with aj,a; E A and w’,v’ E€ X*. Moreover ajra; € X*. Set x = ay --- ding. 
Since x € X*, we have ių < 12,13 < i4,...,to%-1 < doz Since ajra; E X*, we have 
j <%,12 < 73,..., ton < l. Thus 1 <j <ii < tg < +++ < i2k-1 < don <l < 2k, which 
is clearly impossible. Thus u and v have even length, showing that o(X) < k . This 
proves the equality. 


«2.5 -2.10 
Compare this example with Example Ea which is merely Example Bap nith k =o. 
The infinite code was circular but not limited, hence not uniformly synchronized. 


We now give a characterization of MQ Jimited codes by means of automata. These 
codes intervene in Section FARE ee say that a deterministic automaton A = 
(Q,1,1) is an ordered automaton if Q is a partially ordered set, and if the two following 
conditions hold: 

(i) For all qE Q,q <1. 

(ii) For all p,q € Q, and a € A, 


pI<q=>pa<qa. 


PROPOSITION 2.6 Let X C At be a prefix code. The following conditions are equiva- 
lent: 

(i) X is (1,0)-limited, i.e., uv E€ X* >v e X*. 

(ii) X* is recognized by some ordered automaton. 


Proof. (i) = (ii) Let A(X*) = (Q,1,1) be the minimal automaton of X*. Define a 
partial order on Q by 
p<q iff Lp C Lq, 


where for each state p, Lp = {u € A* | p- u = 1}. This defines an order on Q, since 
by the definition of a minimal automaton, Lp = Lq © p = q. Next let q € Q, and let 
u E€ A* be such that 1 -u = q. Then v € Lọ iff uv € X*. Since X is (1,0)-limited, 
uv € X* implies v € X*, or also v € Ly. Thus Ly, C Ly, and therefore q < 1. Further, 
if p,q € Q with p < q, and a € A, let v € Lp.a. Then av € Lp, hence av € Lq, and thus 
v € Lga. This proves that A(X*) is indeed an ordered automaton for this order. 

(ii) > (i) Let A = (Q,1,1) be an ordered automaton recognizing X*. Assume that 
uv E€ X*, for some u,v € A*. Then 1-uv = 1. Since 1-u <1, we have 1l-uv <1-v. 
Thus 1 < 1-v < 1, whence 1-v = 1. Consequently v € X*. a 


EXAMPLE 2.10 Consider the automaton (Q,1,1) given in Fig. E The set Q = 
{1,2,3,4} is equipped with the partial order given by 3 < 2 < 1 and 4 < 1. For this 
order, the automaton (Q,1,1) is ordered. It recognizes the submonoid X* generated 
by 

X = (b’b*a)*{a, ba}. 


Consequently, this is a (1,0)-limited code. 


The following proposition gives another characterization of (1,0)-limited codes. 
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Figure 5.5: An ordered automaton. 7_05 


PROPOSITION 2.7 A prefix code X C At is (1,0)-limited iff the set R= A* — X A* of 
words having no prefix in X is a submonoid. 


Proof. By Theorem fear = X*R. Suppose first that X is (1,0)-limited. Let 
u,u’ € R, and set uu! = zr with x € X*, r € R. Arguing by contradiction, suppose 
that x #1. Then z is not a prefix of u. Consequently x = uv, vr = u’ for some v € A*. 
Since X is (1,0)-limited, one has v € X*; this implies that v = 1, since v is a prefix of 
u’. Thus x = u, a contradiction. Consequently x = land uu’ € R. Conversely, suppose 
that R is a submonoid. Then, being prefix-closed, R is a left unitary submonoid. Thus 
R = Y* for some suffix code Y. From the power series equation, we get 


A*=X*Y*. 


Multiplication with 1 — Y on the right gives X* = A*—A*Y. Thus X* is the 
complement of a left ideal. Consequently X* is suffix-closed. Thus X is (1, 0)-limited. 


.11 
EXAMPLE 2.11 The code X = (b?b*a)*{a,ba} of Example Et gives, for R = A* — 


X A*, the submonoid 
R={b,0 ay. 


We end this section with the definition of a family of codes which is the most restric- 
tive of the families we have examined: a code X C A? is called comma-free if: 
(i) X is bifix; 
ü) o(X)=1. 
Thus X is comma-free iff for all x € XT, u,v € A*, 


uxv € X* > u,v € X*. (5.2.7) 


Comma-free codes are those with the easiest deciphering: if in a word w € X*, some 
factor can be identified to be in X, then this factor is one term of the unique X- 
factorization of w. 


PROPOSITION 2.8 A code X C A* is comma-free iff it is (p,q)-limited for all p,q with 
p+q= 3, and if ATXATNX =Í. 


Proof. First suppose that X is comma-free. Let uo, u1, u2,Uu3 E A* be such that 
ugu, UjU2,uU2QUz; E X*. If uy = uz = 1, then uo,u3 € X*. Otherwise ujus € XT 
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and ugujueu3; € X+. Thus by GH tio, us € X*. Since X is prefix, uo, uou € X* 
implies that u, E€ X*, and X being suffix, u2u3,u3 E€ X implies that ugisinX*. Thus 
uo, U1, U2, ug E X*. Consequently, X is (p,q)-limited for all p,q > 0 with p+q= = 
Furthermore A*XA*t N X = Ø. Indeed assume that uzv,x € X. Then by eye 
u,v € X*, whence u = v = 1. Conversely, let u,v € A* and x € XT be such that 
uxv € X*. Since AtrAt N X = f, there exists a factorization x = ps, with p,s € A*, 
such that up, sv € X*. TOM) Up, ps, sv €E X* it follows, by the limitedness of X, that 
u,p,s,r E€ X*. Thus olds. n 


PROPOSITION 2.9 Let X, Z be two composable codes and let 
X=Yo¥Z. 


If Y and Z are comma-free, then X is comma-free. 


Proof. Let u,v € A* and x € XT be such that urv € X*. Since X C Z*, we have 
uzv € Z*,x C Zt. Since Z is comma-free, it follows that u,v € Z*. Sjnce Y is comma- 
free, this implies that u,v are in X*. Thus X is comma-free by ( 


EXAMPLE 2.12 Let A = {a,b} and X = {aab,bab}. The code X is bifix. Next 
C(aab) = C(bab) = {(1,1)}.Thus o(X) = 1.This shows that X is comma-free. 


3 Length distributions 


We now study the length distributions of circular codes. Let X be a circular code and 
let (Un)n>1 be its length distribution. For each n > 1, let pn be the number of words 
of length n which have a conjugate in X*. 


We set u(z) = asi unz” and p(z) = net Pnz” 


PROPOSITION 3.1 The following relation holds: 
exp) Mat =, (5.3.1) 
n 1— u(z) 


or equivalently 
es ae ae (5.3.2) 
—ulz 
where u' is the derivative of u. 


Proof. We first assume that the code X is finite. 
Let A be the flower automaton of X and let N be the adjacency matrix of the graph 
of A, that is N; ; is the number of edges from 7 to j in A. We have for each n > 0, 


Pn = TAN”), 


Indeed, Tr(N”) = i4 3nd Nj, is the number of paths of length n from i to i. In 
view of Proposition Peach word w of length n which has a conjugate in X* is the 
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label of a unique closed path in A. Conversely, each cycle contains the initial state, 


and thus its label has a conju, ate in, X *. This shows the formula. 
We now. use Proposition me By; ssigning the same symbol z to all letters in Equa- 
tion (eee matrix M of (Brae mes Nz, and X becomes u(z). Thus 
det(I — Nz) = 1 — u(z). 


Let A1,--- , Àg be the eigenvalues of the matrix N counted with their multiplicities. 
Then for each n > 1, pn = Tr(N”) = A} +... + Az. Next, from elementary calculus, 
one has, for any complex number 4, 


exe(> aa = — (5.3.3) 


n>1 


Consequently 


Mit... +X? 
exp) 2" = exp +" k yn 


n>1 n>1 4 
= (A12) (Anz)” 
T A" 4 Geel" 
n>1 
1 1 1 


1— riz 1- Az  det(I— Nz) ` 


. lel . . eo, 
This shows EST for finite codes. In the general case, one considers, for each positive 
integer m, the set of words in X of length at post m. Since each p, depends only on 
the first n terms of the sequence (un), gives the relation up to m. Since this 


) 
holds for each m ithe formula is true also for general codes. 
Formula Sa follows from er by logarithmic derivation, and conversely. In- 
deed, the equality S = T of two series with constant term 1 is equivalent to the equality 
of their logarithmic derivatives. n 


=elementaire . . 
One agp that Formula ( can be viewed as a special case of For- 


mula 


EXAMPLE 3.1 Consider the circular a à = 1g, ba} on the alphabet A = {a,b}. We 


have u(z) = z + 2? and thus by Formula 


(2) z+ 227 
2) ire 
P l1- z-z? 


ig7.1 
The automaton A is represented on Figure Eg We have 
1 1 
wali a 
and thus det(I — Mz) = 1 — z — z?. The eigenvalues of M are the two roots y, ¢ of the 
polynomial 1 — z — z? and pn = p” + ¢". 
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a 
b 
a 

Figure 5.6: The flower automaton of the circular code X = {a, ba} 


le2 
By Formula (3-9) one gets p(z) = zu’(z) + p(z)u(z), from which we obtain the 
following recurrence relation for p, which is useful for numerical computations and 
which is known as Newton’s formula (see the Notes): 


n—1 
Pa = nun + Y Primi: (5.3.4) 
i=1 


There is also a closed formula for pn. For each i > 1, let u = (unt be the length 


distribution of X*. Equivalently, u® is the coefficient of degree n of u(z)’. Then 


Pn n _ 1 _ ul) (z) 
265? ag a 
n>1 n>1 
(i) 


Thus, for each n > 1, the explicit value of the numbers pn in terms of the numbers un 


1S 
Pn = a re 
i= 


We now give a relation with primitive necklaces. Let én be the number of primitive 
necklaces of length n which meet X*. We start with a formula which is useful to 
compute the numbers £n. 


PROPOSITION 3.2 For alln > 1, 
m= Lite 635) 


d\n 


Proof. Let u be a primitive word of length d which has a conjugate in X*. Any power 
v of u has exactly d distinct conjugates and has a conjugate in X*. Conversely, if v has 
a conjugate v’ in X*, let u be the unique primitive word such that v’ is in ut. Since 
X* is pure, the word u is in X*, and thus v itself is a power if a primitive word which 
has a conjugate in X*. This shows the formula. n 


Using the Möbius inversion formula (Proposition 0.3.4), we obtain an explicit formula 


ta = ŻY uln/d)pa. 


d\n 
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The following proposition establishes a direct relationship between the sequences (un) 


and (ln). 
PROPOSITION 3.3 The following relation holds: 
1 
ae II Gaye (5.3.6) 


l-—u 
n>1 


Proof. Since, for each n, 


Pn _ yn dla 
n ’ 
d\n 
we get 
k 
Pn n ere, =. 1 
2 = Da = Le Gah 
n>1 n,k>1 n>1 


Taking the exponential of both sides, we obtain 


p 1 ; 
exp >> an = II Gye (5.3.7) 


n>1 n>1 


le1 le2bi le3 
Putting together Formulae (ERE (6:3. jj. we “Obtain Formula (TEAG m 


Equation Ee allows to compute the sequence (ln) directly from the sequence u = 
(un). We emphasize this in writing /n(u). We also write p,(u) for the corresponding 
sequence pr. 

In the special case of the sequence (k, 0,0,...), we denote Zn (k) instead of @,,((k, 0,0,...)). 
This agrees with Chapter 0 where ¢,,(k) denotes the number of primitive necklaces of 


length n on k symbols. It is clear that thg sequence (€n(k))n>1 corresponds to the code 
X = A and in this case Identity ( reads 


1 


1 
— = —— (5.3.8) | cyclotomicIdent 
1—kz Ul (1 — zm )én(k) 


7.3.3t 
It can be shown that if un < vn for all n, then n (u) < €n(v) for all n (Exercise Bey. = 


EXAMPLE 3.2 Consider again the circular code X = {a, ab} on the alphabet A = {a,b}. 
We have u(z) = z + z2? , and 
z+ 22? 
p(z) = I; z2: 
7 i le-golden 
The first values of pp and £n are given in Table 2 i 


Table 5.1: The values of p, and bn for X = {a, ab}. 
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We shall now characterize the length distributions of circular codes. 

For this, we say that a finite or infinite sequence (x;);>1 of words in At is Hall 
sequence over A if it is obtained in the following way: 

Let Xı = A. Then zı is an arbitrary word in X,. If x; and X; are defined, then set 


Xii = z} (Xi — zi), 
and x;41 is an arbitrary chosen element in X;i+ı satisfying 
[ziy] 2 [zi]. 
The sequence (X;);>1 is the sequence of codes associated with the sequence (x;)i>1- 


PROPOSITION 3.4 Let (xi)i>ı be a Hall sequence over A and let (X;)i>1 be the associ- 
ated sequence of codes. 

1. Each X;, fori > 1, is a (i — 1,0)-limited code. 

2. Each primitive word w such that |w| > |x;| has a conjugate in Xj, 1. 


Proof 1. X; = A is (0,0)-limited. Next 
Xizi =T0 Xj, 


where T is a code of the form 6*(B — b). Clearly T is (1,0)-limited. Assumi 3 aby 
induction that X; is (i — 1,0)-limited, the conclusion follows from Proposition 2.3} 

2. Define zo = 1. We prove that the claim holds for all i > 0 by induction on i. 
For i = 0, the claim just states that any primitive word is in A*. Thus assume 7 > 1, 
and let w € At be a primitive word of length |w| > |a,;|. Since |z;| > |a;_1|, one has 
|w| > |a;-1|. By the induction hypothesis, there is a word w’ conjugate of w which is 
in X*. The word w’ is not in 27 since w’ is primitive and |w’| > |x;|. Thus w’ factorizes 
into w’ = uzv for some u,v € XF and x € X; — z;. Then the conjugate w” = vuz of w' 
is in X? (X; — zi) C X},,. Thus a conjugate of w is in Xj,,. m 


THEOREM 3.5 The sequence u = (un)n>1 is the length distribution of a circular code 
over k letters if and only if €n(u) < €n(k), for alln > 1. 


Proof. Let A be an alphabet with k letters. Let X be a circular code with length 
distribution u = (un). Since @,(u) is the number of primitive necklaces of length n 
which meet X*, one has £,,(u) < €n(k). 

For the converse, we build a Hall sequence. Arguing by induction on n, we suppose 
defined an integer m = m(n) and a Hall sequence z1, ..., £m of words of length at most 
n with the sequence X1,..., Xm of associated codes and thus with Xj41 = v*(X;—2;), 
such that the length distribution of Xm coincides with the sequence u on the n first 
terms. We set for convenience Y, = Xmen): Thus, setting v; = Card(Yn N A’), one has 
vi = ui for 1 < i < n. We prove that 


Un+1 — Un+1 = Ly, l 1(k) = lng (u). (5.3.9) 


Take this equation for granted. Set r = Un+1 — Un+1. Since 0 < r we may select r 
words 2m41,---;Um+r Of length n + 1 in Y, = Xm to carry on the construction of the 
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Hall sequence for r steps. In this way, the sequence 21,...,%@m,%@m+41;---;%m+r forms 
altogether a Hall sequence. Setting m(n+1) = m-+r, the code Yn41 = Xm(n+1) satisfies 
Card(Ypi19 AŻ) = u; for 1 < i < n+1. This is clear for i < n. Next, Yp} N At! 
is obtained from Y, N A"+! by removing r words of length n +1. This finishes the 
induction, starting with Yo = A, 


A 

We now preve Equation (a Since u; = v; for i = 1,...,n, one gets_b Egua- A 
tion paperon = p;(v) for i = 1,...,n. Thus, again by Equation piton 
obtains that paalu) — Pn4i(u) = (n + 1)(Un+1 — Un41)- 

Equation F524 the equalities proved above show that @;(u) = ¢;(v) for i = 
1,...,n. This implies prii(v) — pngi(u) = (n + Lénsi(v) — €n4i(u) which in turn 
shows that €n41(v) — €n41(u) = Un41 — Un4i- a 

Since |£m| < n, the property of Hall sequences stated in Proposition Beat} shows 
that each primitive ies of length n+1 meets X%,. Thus €n41(v) = ln4i(k). This 


proves Equation (| n 


EXAMPLE 3.3 Let A = {a,b} and let u = (0,1,1,3,...). The construction of the proof 


gives 
Xı = {a, b} 
Xə = {b,ab,aab,aaab,...} 
X3 = {ab,aab, abab, aaab, baab, bbab,...} 
X4 = {ab, bab, aaab, baab, bbab,...} 


corresponding to the Hall sequence x; = a, £2 = b, x3 = aab. One gets Yı = X3 and 
Yo = Y = X4. 


: le-circular . . NN : 
We have represented in Table Ba the componentwise maximal length distributions of 
binary circular codes of length at most 4. The list is presented in decreasing lexico- 


graphic order. The last column gives a circular code h ing the indicated distribution 
constructed using the method of the proof of Theorem ah 


Table 5.2: The list of componentwise maximal length distributions of binary circular 


codes of length at most 4 


COROLLARY 3.6 Let A be an alphabet with k > 1 letters. For all m > 1, there exists a 
circular code X C A™ such that Card(X) = ,,(k). 
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Proof. Let u = (Un)ngy_be the sequence with all terms zero except for um which is 
3.9), one has ¢,(u) = 0 for 1 < n < m— 1 and 
£ 4) g Um. Thus Lalu) < n(k) for 1 <n <m. According to the proof of Theorem 

1s suffices to ensure the existence of a circular code X having um words of length 
m. Thus X N A” satisfies the claim. a 


Corollary Bra carr be formulated in the following way: It is possible to choose a system 
X of representatives of the primitive conjugacy classes of words of length m in such 
a manner that X is a circular code. The following example shows that this does not 
hold for an arbitrary set of representatives of the primitive conjugacy classes within 
the words of the same length. 


EXAMPLE 3.4 Let X be a subset of A? — {a? | a € A} and let 6 be the relation over A 
defined by afb iff ab € X. 

Then X is a circular code iff the reflexive and transitive closure 6* of 0 is an order 
relation. 

Indeed, assume first that 0 is not an order. Then 


4102, 4203,.-.,An—1An, Ana, E X 


for some n > 1, and aj,...,@, € A. If n is even, then setting u = a1,v = ag--- an, one 
has uv, vu € X* and u ¢ X*. If n is odd, then (ajaz: an)? € X* but not ayag--- Gn. 
Thus X is not circular. 

Assume conversely that 0 is an order. Then A can be ordered in such a way that 
A = {aa og atk and aja; > i < j. Then X C {aja; |i < j}, and in view of 
Example BE ae cet X is a circular code. 

This statement shows that codes like {ab, bc, ca} are not circular codes, despite the 
fact that they are composed of representatives of primitive conjugacy classes of words 
of the same length. 


Not only are the codes X, Ao in Corollary Tain but they also are uniformly 
synchronized by Theorem e may want to determine the synchronization delay 
of these codes. More precisely, we may look for the minimum delay which is realizable. 
The following result shows that for odd integers m, delay 1 is realizable. 


THEOREM 3.7 For any alphabet A with k letters and for any odd integer m > 1, there 
exists a comma-free code X C A™ such that 


Card(X) = /m(k). 


It follows from Example Biia a circular code X C A? having l(k) elements has 
the form X = {aja; | is Jhd! some numbering of the alphabet. If k is even, then 
according to Example EE e code X has synchronization delay k/2. Consequently, a 
5.7] $ not hold for even integers m. 


of associated codes by setting 
Xı = A, Xi+ı = xr (X; = zi), 1 > 1, (5.3.10) 
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where z; is an element of X; of minimal odd length. By construction, (2;);>1 is indeed 
a Hall sequence. Set 
U=VUsiXi, Y=UnN(A*)*, Z=UNA(A?) 
Thus Y is the set of words of even length in U, and 
Z={zrj|i> 1}. 
For any word u € U, we define 


min{i E€ N| ue X;}-1, 
sup{i € N | u € X;}. 


v(u) 


ô(u) 


Thus v(u) denotes the last time before u appears in some X and 6(u) is the last time 
u appears in a X;. Observe that Y = {u € U | ô&(u) = +00}. Next, note that 6(x;) =i, 
and if v(u) = q for some u € U — A, then u € Xi+q and u € Xq. Consequently u = £qv 
for some v € Xq+1ı . Further, for all u € U and n > 1, we have 


v(u) < n < d(u) > anu E U. (5.3.11) 


We shall prove by a series of lemma that for any odd integer m, the code Z N A™ 
satisfies the conclusion of Theorem B-7. ~ 


LEMMA 3.8 For all odd integers m, we have Card(Z N A™) = ,,(k). 


Proof 
Let n be the smallest integer such that |x,,| = m. Then by construction of the Hall 
sequence (2;), we have 
ZO A™ = tia} Tnt- ,Zn+p} 


for some integer p. Then Z N A™ = X, n N A”, since for all k > 1, words in Xn+k which 
are not in Xn have length strictly greater than |z,,|. Thus Card(Z N A™) = un where 
u = (un) is the length distribution of Xn. ae 

Next, by the definition of n, we have m > |£n-1|. According to Proposition ON 
each primitive word of length m has a conjugate in X*. Thus ln(u) = In(k). 

Let D be the set of odd integers d such that 1 < d < m — 2. By construction of 
the Hall sequence, we have ug = 0 for each d in D. We show by induction d that eon 
palu) = 0 for d € D. It is true for d = 1 since pı = u1 = 0. By Equation ES 
have pg = dug + So PiUd—i. Each term of the right hand side is zero since ug = 0 
and either p; =Q or uqi = 0 since i or d— iis odd. Thus pg = 0. Consequently, by 
Equation BAe. have Ig(u) = 0 for d € D and finally pm(u) = Mum and lm(u) = Um. 

We obtain in this way Card(Z N A™) = é,,(k). n 


LEMMA 3.9 Each word w € A* admits a unique factorization 


W = Y21122": * Zn (5.3.12) 


with y € Y*,zi € Z, n > 0,and 


6(z1) > 6(zg) > > 6(Zp) 
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Proof. First we show that for n > 1 
Xx, = Kits, 


Indeed, by definition Xn+1 = X} (Xn — £n) The product of x*, with Xn — £n is unam- 
biguous since X,, is a code. Thus one has in terms of formal power series 
Consequently, X ny1 = 27 Xn- gx 


Anwi—l=s7Xn— 2, =z, (An 1); 


Snn =n n\— 
18 
Formula (E519 tollows by inver IM. 4g l l 
By successive substitutions in ( , starting with A* = Xf, one gets for all n > 1 
A* = X gh tg te (5.3.14) 


Now let w € A* and set p = ul. Let n be an integer such that Xn+1ı contains no word 
of odd length < p. By here exists a factorization of w as 


W = YZ 22°°' Zk 


with 6(z1) > d(zg) > ++: > (zk), zi E Z and y E€ X71. Since |y| < p, the choice of 
n implies that y is a product of words in Xn4+1 ofe ep length. Consequently y € Y*. 
This proves the existence of one factorization ( . Assume that there is second 
factorization of the same type, say, 


a hh sh / 
W = Y 24220 °°* Zn 


Let m be an integer greater than ô(z1) and ô(z21), and large enough to ensure y, y’ € 
Xm+1: Such a choice is possible since all gvgn, words of some code X; are also in the 
codes Xy, for l’ > 1. Then according to ( , both factorizations of w are the same. 

| 


. . mai 17 
Now, we characterize successively the form of the factorization PSTD for words 
which are prefixes and for words which are suffixes of words in U. 


. . Š . 7 . 
LEMMA 3.10 Each proper prefix w of a word in U admits a factorization (ESIA) with 
y= 


Proof. Each of the co ig ê maximal prefix code. This follows by iterated appli- 
cation of Proposition onsequently for n > 0, 


A*=XiuPay 


where P,41 = X, g ÍS the set of proper prefixes of words of Xn+1. Comparing this 
equation with ( ae get 


Pny =Ln£n-1°°'£1- (5.3.15) 


Let now w be a proper prefix of some wor. 4h in U. Then u € Xn41 for some n > 0 
and consequently w € Pnii. By Eq. ( , w admits a factorization of the desired 
form. n 
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LEMMA 3.11 For all n,p > 1, we have tnInpp € Y*. Further forz € Z andy €Y, 
we have zy € Y*Z. 


Proof. The first formula is shown by induction ° : ie p = 1, we have v(tn41) <n 
since £n+1 E Xn+1. Thus according to formula , we have £n£n+1 € U. Since 
LnXLn+1 has even length, tnensicy. 

Assume that the property holds up to p= and set q = V(£n+p). We distinguish 
two cases. First assume q < n. Then by c iV, with £n+p playing the role of u, we 
have £nť£n+p € U. This word has even length. Thus £n£n+p € Y. 

Next suppose that n < q. Then £np E U — A. Consequently tn4) = xgu for 
some u € U. Since q < n+p (= ô(Xn+p)), we have tnx, € Y* by the induction 
hypothesis. Next u has even length (because |x,,|,|vq| are both odd). Thus u € Y, 
whence @ylnip € Y . 

Let us prove the second formula. Set n = 6(Z) and q = v(y). Then z = £n 
and y = x gx; for some t. If n < q, then £nz4 E€ Y* by the preceding argu ts 
and consequently zy € Y*Z. On the contrary, assume q < n. Then by eae 
InLg@t = Tny E U. Since it has odd length this word is in Z. m 


T 
LEMMA 3.12 Any suffix w of a word in U admits a factorization (SID with n=0 


orn=l1. 


Proof. Given a word u € U, we prove that all its suffixes are in Y*Z U Y*, by induction 
on |u|. The case |u| = 1 is clear, and clearly it suffices to prove the claim for proper 
suffixes of words in U. 

Assume |u| > 2. Set n = v(u). Since u € U — A, we have u = zu’ for some wu’ € U. 

Let w be a proper right factor ot u. If w is a suffix of u’, then by the induction 
hypothesis, w is in Y*Z U Y*. Thus we assume that w = w’u’, with w’ a proper suffix 
of x, . By induction, w’ is in Y*Z U Y*. If w € Y*, then wu’ € Y*(Y U Z) and the 
claim is proved. Thus it remains the case where w’ € Y*Z. In this case, ee = YTk 
with y € Y*, k > 1. Observe that k < n since |z| < |w’| < |x| (see Fig. pe 


u: © O O 
w (0; wv ò u O 
w a a 
Figure 5.7: 


3.12 
We now distinguish two cases: First, assume u’ € Y. Then by Lemma Boop eeu’ € 
Y*Z. Consequently, w = yx,u' € Y*Z. Second, suppose that u’ € Z. Then u! = £m 
for some m, We haye £m € X}, implying that m > n. Since k < n, we have k < m and 
by Lemma , Tk£m € Y*. Thus w = YTk£m € Y*. This concludes the proof. n 


-3.8 
Proof of Theorem Eite m be an odd integer and let X = ZN A™. Let x, x’, x" € X. 
Assume that for some u,v € AF, 


rz = ua" (5.3.16) 
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Then for some w,t € A*, we have z = uw,x” = wt, x' = tv. Since x” has odd length, 


one of the wor y and t must have even length. Assume that the length of wi ii 
even (see Fig. . Since w is a proper prefix of 2” € Z, we have by Lemma pe 


a factorization W = 229°: Zn with 2, 22,...,2n € Z and (z1) > --- > (Za) Qg 
the other hand, the word w is a suffix of x € Z, and according to Lemma we 
have w € Y*Z U Y*. Since w has even length, w € Y* Thus n = 0 and w = 1, a 
contradiction. . 
A a 
u Ww t Vv 
x 
Figure 5.8: 7_07 


EXAMPLE 3.5 Let A = {a,b}. A sequence (#,)n>1 satisfying the conditions of the 


construction given above is 


Xi = {a, b}, 
Xə = {b, ab, ab, a?b,atb,...}, 
X3 = {ab, ab, a?b, afb,... 

bab, ba?b, ba®b,... 


b?ab, b?a2b,... 
bab,...}, 
X4 = {ab, bab, ab, a‘b,... 
bab, bab,... 
b?ab, b?a?b,... 
b’ab,... 
a?bab,...}, 
Xs = {ab, ab, atb,... 
ba?b, baðb,... 
bab, b2a7b,... 
b’ab,... 
a?bab,... 
babab,...}. 


We have represented only words of length at most five. Words of the same length are 
written in a column. Taking the words of length five in X5, we obtain all words of 
length five in the code Z. Thus the following is a comma-free code X C A3: 


X = {a‘b, bab, b?a7b, b° ab, a”bab, babab}. 


It has Card(X) = ¢(5) = ements. The words of length three in X3 give the 
comma-free code of Example 
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4 Exercises 


. ction7.1 
Section 


1.1 Show that the submonoid {ab, ba}* is pure. 


1.2 (Fine-Wilf Theorem). Show that if three words x,y,z satisfy cy" = zP with 
m,n,p > 2, then the three words xz, y, z belong to the same cyclic submonoid t*. 

(Hint: first prove that the conclusion holds if p > 3 considering the conjugate z’ of 
z which is a Lyndon word. Then solve the case p = 2 using the fact that for some 
conjugate x’ of x, the equality x’ = uy” holds for some u). 


1.3 Let X = {x,y} be a code with two elements. Show that if X* is not pure, then 
the set x*y U y*x contains a word which is not primitive. 

Hint: consider the least integer i > 1 such that w? € X*ay'xX*. Replacing w be 
an X-conjugate, suppose that y’x is a prefix of w and x a suffix of w. Let w’ be an 
X-conjugate of w such that wh = hw’ and with h shorter than the word z € X such 
that w’ € X*z. Distinguish three cases: (1) w € yX*z, (2) w € eX*z and |hz| > yf, 
(3) w € X*y and |ha| > |y’|. ). 


: 7.1.2 f are 
1.4 Deduce from Exercise Eg that if « = uv and y = vu are conjugate primitive 
words, then X* = {z, y}* is pure. 


. section7.2 
Section 


2.1 Let X C A? be a finite code and let A = (Q,1,1) be an unambiguous trim 
automaton recognizing X*. Let y be the associated representation. Show that X* is 
pure iff the monoid y(A*) contains no nontrivial groups. 


2.2 Let A be a finite alphabet and let X C AT be a finite code. Show that X is 
circular iff there exist finite subsets T,U,V,W of At such that 


X* = TU (UA* N A*V) — AW A". 


2.6 
(Hint: Use Theorem Z9 


2.3 A set X C A* is called (p,q)-constrained for some p,q > 0 if for each sequence 
Uo, U1,- -- , Up+q Of words the condition uj;_ju; E€ X for 1 < i < p + q implies up € X*. 
(a) Show that, for p+ q < 2, a set X is (p,q)-constrained iff it is (p, q)-limited. 

(b) Let A = {a,b} and X = {a,ab}. Show that X is (3,0)-constrained but not 
(3, 0)-limited. 


2.4 Let X C At be a maximal prefix code. Show that the following conditions are 
equivalent. 
(i) o(X) =1, 
(ii) A*X c X*, 
(iii) X is a semaphore code such that S = X — AX satisfies SA* N A*S = 
SUSA*S. 


2.5 Show that a recognizable code is limited iff it is circular. 
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i section7.3 
Section 


3.1 Show that the set Ln of Lyndon words of length n over a k letter alphabet is a 
circular code. Show that Ln is comma-free iff n = 1 or n = 2, k < 3 or (n = 3,4 and 
k > 2). 


3.2 Let A bea k letter alphabet and let s € At be a word of length p. Let R be the 
finite set 
R = {w € A* | sw € A*s, |w| < p}. 


Let X be the semaphore code X = A*s — A*sA*. For all T C A*, we denote by ur 
the series 
ur = pJ Card(T N A™)t”. 


n>0 


5 
Using Proposition Hea iow that 


UT 


tP 
~ P+(1-— kt)ur 


Now let Z = (sA*t N Ats) — AtsA*. Show that s+ AX =X +Z. Let U = Zs™!. 
Show that for all n > p, the code U N A” is comma-free and that the generating series 
of the length distribution of U is 


(kt — 1) 


= ——_____ +] 
tP + (1—kt)ur = 


uy 


3.3 Show that for an sequence (Un)n>1 Of nonnegative integers, the sequence pp 
defined by Formula ( 1s formed of nonnegative integers. 


3.4 Let (Un)n>1 be a sequence of nonnegative integers. Let A be a weighted alphabet 
with u, letters of weight n for each n > 1. The weight of a word is the sum of the 
weigths of its letters. 

Show that @,,(u) is the number of primitive necklaces on the alphabet A with weight n. 


3.5 Let (Un)n>1 and (Un)n>1 be two sequences of integers such that 0 < Ung Yn for 
each n > 1. Show that @,(u) < ¢,(v) for all n > 0 (hint: use Exercise Bea 


3.6 For any sequence (Vn)n>1 of complex numbers, define the sequence (pn) by 


Dn = >. dun! 4. 


d\n 


Show that, in terms of generating series, one has 


exp X` Taar = Jc — unz”)! 


n>1 n>1 
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. . . . lei 
3.7 Show that the coefficient of z” in the series of Equation (EREE equal to the 
C of multisets of primitive necklaces meeting X* whose total degree (that į the. a 
sum of the lengths of the necklaces) is n. Give two proofs, one using Equation (a 


the other by applyi alae free monoid X* the property of complete factorizations 
given in Corollary , using the fact that X* is a very pure submonoid. 


ittVectors 
3.8 Take the notations of Exercise Ea with p, as at the beginning of Section 4. 


Show that „the py are nonnegative integers. Hint: they are already, int, as, using 
Equation . By iteration of the fundamental bisection of Example ee the 
existence of codes X,, and Cn, defined by: X, = X, Cn = {x E€ Xn,| £ |= n}, Xn = 
(Xn — Cn)C% such that the free monoid X* has the factorization X* = C{C3...C7_X}41- 
Show that vn is the cardinality of Cy. 


3.9 A language L is called cyclic if (i) for any words u,v, uv € L if and only if vu € L, 
and (ii) for any word w and positive integer n, w € L if and only if w” Ef The zeta 
function of a language is given by the left-hand side of Equation E i), where pp is 
the number of words of length n in L. 

Show that if X is a circular code, then the closure under conjugation of X* is a cyclic 
language. Show that the latter is rational if the former is. Show that its zeta function 
is equal to the generating function of X*. Show that more generally, the zeta fu ction a 
of a cyclic languge L has the expansion given in the right-hand side of Equation (ae 
where lp denotes the number of primitive necklaces of length contained in L. Deduce 
that it has therefore natural integer coefficients. 


5 Notes 
The definition of limited codes is from Schiitzenberger (1965c), where limited codes 


are defined by some condition U/,(p,q) for p < 0 < q which is our condition C(—p, q). 
The notions of synchronization =a GO) and of orn Be ynehronized code have aa 


introduced in a 965). Theorem is from [De Luca and Restivo 


(1980). Theorem 2 Eis in Sey DA See also [Lasse (1976) where the term 


“circular code” is used. 


There is a close connection between the formulas concerning the length distributions 


of circular codes and the field of symmetric functions (see ({L995))) . Actually, 
the numbers un are, up to the sign, the elementary symmetric functions of the roots 
ra 1S 


of the polynomial 1 — u(z) and the pn are the sums of powers. Formula ( 
well-known in this context E Formula ears is known as Newton’s formula. 


The left side of Formula ( is often called a zeta function. In the context of 
symbolic dynamics, the zeta function of a subshift S is defined as 


z) = exp > Payr 


n>1 


where pn is the number of points of period n (see (11995). This 
corresponds to our hypotheses, considering the subshift formed = al infinite words 


having a factorization in words of X. In this context, Formula ( 15 a particular 
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case of a result of Manning which is the following. Let S be the subshift formed of all 
two-sided infinite paths in a graph G. Let M be the adjacency matrix of G. Then 


1 
6s(2) = det(I — Mz)’ 
tomicIdent 
The numbers @,,(k) are called the Witt numbers and Identity (xa is called the cy- 


clotomic identity. , 
Th-lêngthdistrib 


Theorem .5 1s due to P (1965c). The proof uses a method known in 


the context of free Lie algebras as Lazard elimination method. 


ec 

The pair eee defined as in Exercise is called a Witt vector (see g) (965) 
or Metropolis and Rota (1983p), tede link between Witt vectors and codes ‘and the 
construction given in Exercise B.0 is due to 

Cyclic languages and their zeta functions were introduced in 
(1990). It is shown there = the zeta function of a yaşignal se language is a rational 
function (see also [Béal et al] Béal et al] (1996)). Exercise Paes that this is true if the cyclic 
language is the closure alee conjugation of a rational circular code. In 
(1997), it is shown that each rational cyclic language is the disjoint union of the closure 
under conjugation of rational very pure monoids; this implies that the zeta function is 
N-rational. 


The story of comma-free codes is a pleasant one. They were introduced in 
(L958). Some people thought at that time that the biological code is comma- 
free (Crick’s hypothesis). The number of amino acids appearing in proteins is 20. 
They are coded by words of length three over the alphabet of bases A,C,G,U. Now, 
the number ¢3(4) which is the maximum number of elements in a comma-free (or 
circular) code composed of words of length three over a four-letter alphabet is precisely 
20. Unfortunately for mathematics, it appeared several years later with the work of 
Niernberg that the biological code is not even a code in a sense a this book. Several 
triples of bases may encode the same acid (see Yeas} ({L969 or Btryer] (L975)) . This 
disappointment flges not weaken the interest of circular codes, we believe. 
Theorem Tas been conjectured by Golomb et al. and proved by | (L965). 
A simple construction has been given by Scholtz] (L969), on which the proof given here 
is based. Other constructions which are possible are described in 
(1981). For even length, no formula is known giving the maximal number of elements 
of a on code e (See Piggs eee 
the Solutions follows RI T.4) are from [Lentin] 
Do . Exercises a (L974) (see also 
fHashiguchi and Honda (L976b)). These statements have a natural place within the 
AA of the theory of varieties of monoids (see Eilenberg] (L976) or Pin] (L984)). 


Exercise may be formulated in a more general way. For this a subset X of AT 
is called aperiodic if its syntactic monoid contains no nontrivial subgroup. Given an 
aperiodic code X, the monoid X* is pure iff X* is aperiodic (see (1977) and 
((L979)). 


7.3.2 
Exercise Be 1s from |Guibas and Odlyzkg ({1978})). The codes introduced in this exercise 
were defined by (L960) and named prefiz-synchronized. Gilbert has conjectured 


eaa ad See a Re prpof given i in 
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that UM A” has maximal size when the word s is chosen unbordered and of length 
log; n. 


This conjecture has been settled by (Guibas and OdlyzkoJ (L978). It holds for k = 


2,3,4, but is false for k > 5. 


J. Berstel, D. Perrin and C. Reutenauer Version 22 janvier 2008 


262 V. Circular Codes 


Version 22 janvier 2008 J. Berstel, D. Perrin and C. Reutenauer 


Chapter VI 


Automata 


0 Introduction 


In Chapters Mandi we studied special families of codes. We now return to the study 
of general codes. In the present chapter, we introduce a new technique: unambiguous 
automata. The main idea is to replace computations on words by computations on 
paths labeled by words. This is a technique which is well known in formal language 
theory. It will be used here in a special form related to the characteristic property of 
codes. 

Within this frame, the main fact is the equivalence between codes and unambigu- 
ous automata. The uniqueness of paths in unambiguous automata corresponds to the 
uniqueness of factorizations for a code. Unambiguous automata appear to be a gener- 
alization of deterministic automata in the same manner as the notion of a code extends 
the notion of a prefix code. 

To each unambiguous automaton corresponds a monoid of relations which is also 
called unambiguous. A relation in this monoid corresponds to each word and the 
computations on words are replaced by computations on relations. 

The principal result of this chapter (Theorem j shows that very thin codes are 
exactly the codes for which the associated monoid satisfies a finiteness condition: it 
contains relations of finite rank. This result explains why thin codes constitute a natural 
family containing the recognizable codes. It makes it possible to prove properties of thin 
codes by reasoning in finite structures. As a consequence, we shall give, for example, an 
alternative proof of the maximality of thin complete codes which does not use measures. 
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The main result also allows us to define, for each thin code, some important pa- 
rameters: the degree and the group of the code. The group of a thin code is a finite 
permutation group. The degree of the code is the number of elements on which this 
group acts. These parameters reflect properties of words by ans of “interpretations” . 
For example, the synchronous codes in the sense of Chapter ee ose having degree 
1. 


One of the applications of the notions introduced in this chapter is the oof of, the 
theorem on the synchronization of semaphore codes announced in Chapter Fi direct 
combinatorial proof of this result would certainly be extremely difficult. 


This chapter is organized in the following manner. In the first section, the notion of 
unambiguous automaton is introduced. The basic re ult on the correspondence between 
codes and unambiguous automata is Proposition E l 


In the second section, we study various types of unambiguous automata which can be 
associated with a code. In particular, the flower automaton is defined and it is shown 
that this is some kind of universal automaton in the sense that any unambiguous au- 
tomaton associated with a code can be obtained by a reduction of the flower automaton 
of this code. We also show how to decompose the flower automaton of the composition 
of two codes. 


In Section e properties of unambiguous monoids of relations are proved. These 
monoids constantly appear in the sequel, since each unambiguous automaton gives rise 
to an unambiguous monoid of relations. In particular, we define two representations of 
unambiguous monoids of relations, called the R and £L-representations. These repre- 
sentations are relative to a fixed idempotent chosen in the monoid, and they describe 
the way the elements of the monoid act by right or left multiplication on the R-class 
and the £-class of the idempotent. 


ction4.4 
The notion of rank of a relation is defined in Section BF fhe most important result 
in this section states that the minimal ideal of an unambiguous monoid of relatio is, 5 
formed of the relations having minimal rank, provided that rank is finite (Theorem PE 


Moreover, in this case the minimal ideal has a well-organized structure. 


In Section B ke retain to codes. We define the notion of a very thin code which is 
a refinement of the notion of thin code. The two notions coincide for a complete code. 
Then we prove the fundamental theorem: A code X is very thin iff the associate Re 
ambiguous monoid of relations contains elements of finite positive rank (Theorem 


Several consequences of this result on the structure of codes are given. 


ction4.6 
Section Bj contains the definition of the group and the degree of a code. The definition 
is given through the flower automaton, and then it is shown that it is independent of 
the automaton considered. We also show how the degree may be expressed in terms 
of interpretations of words. Then synchronous codes are defined, and it is shown that 
these are the codes with degree 1. 


In Section 7, we iye, the proof of the theorem on the syn ronization hore 
codes (Theorem which has been mentioned in Chapter eorem 7 
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1 Flower automaton 


We describe in this section the construction of a “universal” automaton recognizing a 
submonoid of A”. 
Let X be an arbitrary subset of At. We define an automaton 


Ap(X) = (Q,1,T) 


by 
Q = {(u,v) € A* x A* | uv € X}, 


T=1xX, T=xX x1 


with edges 


(x) ay O ae et € an GD 


Figure 6.1: The edges of Ap(X) for x = ajaz... an. 
iff ua = u’ and v = av’. In other words, the edges of Ap are 
(u,av) > (ua, v), uav E X. 


It is equivalent to say that the set of edges of the automaton Ap is the disjoint union 
of the sets of edges given by Fig. or each © = a1a2...an in X. The automaton 
Ap(X) is unambiguous and recognizes X, i.e., 


|Ap(X)| =X. 


The flower automaton of X is by definition the automato blaat RA * obța ng 

Ap(X) by applying the construction described in Section Feet ormu ae ) REI E 
It is denoted by A(X) rather than (Ap(X))*. We den te hy „gp the associated 
representation. Thus, following the construction of Section Feet maton A(X) 


is obtained in two steps as follows. Starting with Ap(X), we add a new state w, and 
the edges 


w — (a,v) for av € X, 
(u,a) “Sw for ua € X, 
a 
w —W fora Ex. 


This automaton is now trimmed. The states in 1 x X and X x 1 are no longer accessible 
or coaccessible and consequently disappear. Usually, the state w is denoted by (1,1). 
Then Ah (X) takes the form 


Ap(X) = (P, (1,1), (1, 1)) 


with 
P = { (u,v) € At x At | w € X} U {(1,1)}, 
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and there are four types of edges 


(u,av) —> (ua,v) for uavexX, (u,v) 4 (1,1), 
(1,1) = (a,v) for ave X, v1, 
(u,a) = (1,1) for uacX, u#l, 
(14,1) = (1,1) for aeX. 


The terminology is inspired by the graphical representation of this automaton. Indeed 
each word x € X defines a simple path 


(1,1) > (1,1) 
in Ap(X). If z =a E A, it is the edge 

(1,1) 5 (1,1). 
If x = ajaQ--- ay with n > 2, it is the path 


(1,1) = (a1, 49°++ Gn) => (a1a2,03 an) 4 +++ = (1,1). 


T7 
EXAMPLE 1.1 Let X = {aa, ba, bb, baa, bba}. The flower automaton is given in Fig. we 


Figure 6.2: The flower automaton of X. 


THEOREM 1.1 Let X be a subset of At. The following conditions are equivalent: 
(i) X is a code. 
(ii) For any unambiguous automaton A recognizing X, the automaton A* is un- 
ambiguous. 
(iii) The flower automaton A(X) is unambiguous. 
(iv) There exists an unambiguous automaton A = (Q,1,1) recognizing X* and 
X is the minimal set of generators of X*. 
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Proof. (i) = (ii) is Proposition EI The implication (ii) = (iii) is clear. To prove 
(iii) = (iv) it suffices to show that X is the minimal generating set of X*. Assume 
the contrary, and let x € X,y,z € X+ be words such that x = yz. Then there exists 
in A$ (X) a simple path (1,1) = (1,1) and a path (1,1) => (1,1) —> (1,1) which 
is also cue a a. These paths are distinct, so A p(X) is ambiguous. (iv) = (i): by 
Proposition #4, is free; thus X is a code. n 


We shall now describe explicitly the paths in the flower automaton of a code. 


PROPOSITION 1.2 Let X C At be a code. The following conditions are equivalent for 
all words w € A* and all states (u,v), (u’,v’) in the automaton Ah (X): 
(i) There exists in A(X) a path 


c: (u,v) = (u’,v’) 


(ii) w € vX*u' or (uw =u’ andv=wv' ), 
(iii) uw € X*u’ and wv’ € vX*. 


Proof. (i) = (ii) If c ig a simple path, then it is a path in Ap. Consequently, uw = u’ 
and v = wv’ (Fig. a)). Otherwise c decomposes into 
c: (w0) > (11) 5 (1,1) = wr) 


with w = vzu’ and x € X* (Fig. Beeb). 

(it) > (iii) If w € vX*u’, then uw € uvX*u’ C X*u’ and w € vX*u'v’ C vX*, since 
uv, uv € X U1. If uw = u’ and v = wv’, then the formulas are clear. 

(iii) => (i) By hypothesis, there exist x,y € X* such that 


Let z = uwv’. Then 


Fat 
z = uwv = ruv =uvy € X*. 


Each of these three factorizations determines a path in Ap (X) 


(a) 


Figure 6.3: Paths in the flower automaton. 


c: (1,1) —> (@,5) > (w'y) = (1,1), 
d:,1) A, H, (u,v) ma 1,1); 
ds) > (u,v) > ,1) > (1,1), 
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(The paths (1,1) = (u,v) — (1,1) and (1,1) 2; (u', v") 2, (1,1) may have 
length 0.) Since X is a code, the automaton A} (X) is unambiguous and consequently 
c= = c". We obtain that (u,v) = (@,0) and (u’,v’) = (a, 0’). Thus 


(u,v) > (u’,v’). 


The flower automaton of a code has “many” states. In particular, the flower automaton 
of an infinite code is infinite, even though there exist finite unambiguous automata rec- 
ognizing X* when the code X is recognizable. We shall show that in some sense which 


will be made more precise now, A(X) is universal among the automata recognizing 
X* 
Consider two unambiguous automata 


ASIP R1) and B = (Q,1,1) 
and their associated representations y4 and yg. A function 
p:P>Q 
is a reduction of A onto B if it is surjective, p(1) = 1 and if, for all w € A*, 


(q, yglw), q')=1 


iff there exist p, p' € P with 


(p, pa(w), p) =1,  pp)=q4, pp)=d. 


The definition means that if p > p' is a path in A, then p(p) —> p(y’) is a path in B. 
Conversely, a path q > q' can be “lifted” in some path p => p' with p € p-'(q),p' € 
p*(q). 

Another way to see the definition is the following. The matrix yg(w) can be obtained 
from y,4(w) by partitioning the latter into blocks indexed by a pair of classes of the 
equivalence defined by p, and then by replacing null blocks by 0, and nonnull blocks by 
1. 

Observe that if p is a reduction of A onto B, then for all w,w’ € A*, the following 
implication holds: 


pa(w) = pa(w') = yp(w) = p(w’) 
Thus there exists a unique surjective morphism 
Pp: palA*) > gp(A*) 
such that 
pB = ĤO PA 


The morphism ô is called the morphism associated with the reduction p. 
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PROPOSITION 1.3 Let A = (P,1,1) and B = (Q,1,1) be two unambiguous trim au- 
tomata. Then there exists at most one reduction of A onto B. Ifp: P= Q isa 
reduction, then 

1. |A| c |B], 

2. |A| = |B| iff p11) =1. 
Proof. Let p, p' : P —> Q be two reductions of A onto B. Let p € P, and let q = p(p), 


q! = p' (p). Let u,v € A* be words such that 1 “> p —> 1 in the automaton A. Then 
we have, in the automaton B, the paths 


1> q1, 15> qg =>1. 
Since B is unambiguous, q = q’. Thus p = p’. 
1. If w € |A], there exists a path 1 > 1 in A; thus there is a path 1 => 1 in B. 
Consequently w € |B]. 
2. Let w € |B|. Then there is a path p > p' in A with p(p) = p(p') = 1. If 
1 = p7}(1), then this is a successful path in A and w € |A|. Conversely, let p 4 1. Let 


1 — p — 1 be a simple path in A. Then wv € X, where X is the base of |A|. Now in 
B, we have 1 => p(p) > 1. Since |A| = |B|, we have p(p) Æ 1. Thus p7} (1) = 1. m 


PROPOSITION 1.4 Let X C At be a code, and let A} (X) be its flower automaton. 
For each unambiguous trim automaton A recognizing X*, there exists a reduction of 


A H(X) onto A. 
Proof. Let A(X) = (P, (1,1), (1,1)) and A = (Q, 1,1). Define a function 
pP: P —> Q 


as follows. Let p = (u,v) € P. If p= (1,1), then set p(p) = 1. Otherwise uv € X, and 
there exists a unique path 


c:1 =q => 1 in A. 


Then set p(p) = q. 
The function p is surjective. Let indeed q E€ Q, q 41. Let 


a :1— q, c&:q—1 


be two simple paths in A. Then w € X, and p = (u,v) € P satisfies p(p) = q. 
We now verify that p is a reduction. For this, assume first that for a word w € A*, 
and q,q' € Q, 
(q,¢a(w),q') = 1. 


Consider two simple paths in A, e : 1 —> q, e' : q! —> 1. Then in A, there is a path 


r 


1 “sq Z, d tyi 


Consequently uwv’ € X*. Thus for some x; € X, uwv’ = z112: Xp. Since e is simple, 
u is a prefix of xı, and similarly v’ is a suffix of £n. Setting 


ry = uv, CSU 
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we have 
1 Jos 
UWV = UVTI ``: In = T1: Ln_1Uv, 


whence 
/ 
uw € X*u’, wo’ € ux". 


22 
In view of Proposition EFt, v), pp(w), (u’,v’)) = 1. 


Suppose now conversely that 


(p, pp(w),p') = 1 (6.1.1) 


for some p = (u,v),p’ = (u’,v’), and w € A*. Let q = p(p), q! = p(p’). By construction, 
there are in A paths 


1=>q >11 ad 15g 41 (6.1.2) [eq4.2.2] 
2.2 at 
In view of Proposition ormula (this equivalent to 
{uw =u! and v = wv} or {w = vru' for some x € X*} 


2 
In the first case, uv = uwv’ = u'v'. . Thus the two paths (BH oincide, giving the 
path in A, 


1 “q Z, d "sd. 


In the second case, there is in A a path 


Thus, (q, y.4(w), q’) = 1 in both cases. T 


EXA RLE 1.2 For the code X = {aa, ba, bb, baa, bba}, the flower automaton is given in 
Fig. ; 


Figure 6.4: The flower automaton of X. 4_09 
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0 
Consider the automaton given in Fig. ba The function p : P —> {1,2,3} is given by 


p((a,a)) = p((ba,a)) = p((bb,a)) = 2, 
p((b,a)) = p((b,b)) = p((b, aa)) = p((b, ba)) = 3, 
1 = 1. 


= 
t 
7 


, 


p(( 
The matyi gs of th associated representations (with the states numbered as indicated 
in Figs. and are 


Figure 6.5: Another automaton recognizing X*. 4_10 


1ı [0 1 000000 
2[roo ooo 
3/100 00 00 0 
Oa ee eee eed osha ae 
5110000000 eaa 
6/0 0000000 
ESEE EETI 
s |0 0000000 
LPO 0 00 1 1 11 
soo ooog oo] 
3100 00 00 0 0 a ie 4 
pa- 4 {990900007 v(t) =| 0 0 
5100000000 aia 
61100000001 
ele a al 
s |0 00 10 0 0 0 


The concept of a reduction makes it possible to indicate a relation between the flower 
automata of a composed code and those of its components. 


PROPOSITION 1.5 Let Y C Bt, Z C A be two composable codes and let X =Y 0g Z. 
IfY is complete, then there exists a reduction of A} (X) onto Ah(Z). Moreover, Ah (Y) 
can be identified, through B with the restriction of Ap(X) to the states in Z* x Z*. 


Proof. Let P and S be the sets of states of A7,(X) and A (Z) respectively, and let yx 
and yz be the representations associated to A} (X) and A} (Z). 
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We define the function 
p:P>S5S 


as follows. First, let p((1,1)) = (1,1). Next, consider (u,v) € P—(1,1). Then wv € Z+. 
Consequently, there exist unique z, Z € Z*, and (r,s) € S such that 


(see Fig. bray. Then let p(u, v) = (r,s). The function p is surjective. Indeed, each word 
in Z appears in at least one word in X; thus each state in S' is reached in a refinement 
of a state in P. 

To show that p is a reduction, suppose that 


((u, v), yx (w), (u’, v’)) =1. 
Let (r,s) = p((u,v)), (r’, 5’) = p((u’,v’)), and let z, Z, 2’, 2’ € Z* be such that 


u 


v 


Figure 6.6: Decomposing a petal. 4_ 


U = 27, UV = 8Z, u = zr’, v = sz’ 
2.2 
By Proposition i A uw € X*u’, wo! € vX*. Thus zrw € Z*r’, ws'2Z’ € sZ*, implying 
that zrws’ € Z* and rws'z € Z*. This in turn shows, in view of the stability of Z*, 
that rws’ € Z*. Set zrw = ĉr’, with 2 € Z*. Then 


and each of the four factors in this equation is in Z*. Thus Z being a code, either 2 = zt 
or z = 2t for some t € Z*. In the first case, we get tr's! = rws', whence rw € Z*r’. The 
second case implies r’s’ = trws’. Since r's’ € 1U Z, this forces t = 1 or rws’ = 1. ts 
both cases, rw € Z*r’. Thus rw € Z*r’, and similarly ws’ € sZ*. By Proposition F> 


((r, s), yz(w), (r’, s’)) =l 


Assume conversely that 
((r, 8), pz(w), (r,s) =1 


2.2 
Then by Proposition f 3 
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for some z,z’ € Z*. Then rws’ € Z*, and Y being complete, there exist t,t’ € Z* such 
that m = trws't! € X*. Let 


m = trwst = trsz't = tzr's't = £1- £n 


with n > 1, £1,...,£n E€ X. We may assume that t and t’ have been chosen of minimal 
length, so that t is a proper prefix of xı and t is a proper suffix of z,. But then, sjn 
m € Z* and also trs € Z*, trs is a prefix of xı and r’s’t’ is a suffix of x, (Fig. 
Define 

£1 = Uv with u = tr, v E€ sZ* 

Tn = u" with u’ = t'r’, v E€ s'Z* 


Then (u,v) and (u’,v’) are states of A} (X), and moreover 


7 u a: ou v > 
hehe. ee A a. ee 
tg EO SG Lo awe gt Se 
fa Ne OE E w feet Ne N 
i 7 ‘~ O 
xX Xn 
(a) 

o u y o 
Q V 9 V Q 
MEE . fone. een ME i 

i t riS ars t 5 
1 ` , 1 1 
m: 6 Y Po O a r a i } 


and 


/ (peed 
m = UWY = Ut = Ey = T nU N 


Thus 
uw € Xu’ and wv € vX*. 


Finally, consider the set R of states of Ap(Y). Then R can be identified with 
R' = {(u,v) € P | u,v € Z*}. 


The edges of A7,(Y) correspond to those paths (u,v) > (u’,v’) of A p(X) with end- 
points in R’, and with label in Z. " 


hapteri 
EXAMPLE 1.3 Recall from Chapter that the code X = {aa, ba, bb, baa, bba} is a com- 
position of Y = fcc, d,e,dc,ec} and Z = {a, ba, bb}. The flower automaton A} (X) is 
given in Fig. i 


J. Berstel, D. Perrin and C. Reutenauer Version 22 janvier 2008 


4_ 


12 


274 VI. Automata 


4 
The flower automaton A} (Z) is given in Fig. ea It is obtained from A} (X) by the 
reduction 


( 

(6) = (8) = 
p(5) = p(7) = 7. 

The flower automaton A} (Y) is given in Fig. Bara 


Figure 6.8: The flower automaton of X. 4_13 


(ba) 8 


(bo)? 
Figure 6.9: The flower automaton of Z. 4_14 


Let X C At be a code and let 3 : B* — A* be a coding morphism for X. Since £ is 
injective, there exists a partial function, 


y: A* > B* 


with domain X* and such that y(8(u)) = u for all u € B*. We say that y is a decoding 
function for X. 
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Figure 6.10: The flower automaton of Y. 4_15 


Such a decoding function can be computed by an automaton with output called a 
decoding automaton. The construction goes as follows. Let A7,(X) = (P, (1, 1), (1, 1)) 
be the flower automaton of X. We define for each edge e of this automaton an output 
label o(e) € BU1. Let e = ((u,v), a, (u’,v’)) be an edge of A (X). Then we set 


ole) = { 6 (ua) if (u’,v’) = (1,1), 


1 otherwise. 


The output label o(c) of a path c is the product of the output labels of the edges that 


Figure 6.11: The decoding automaton. 4_16 


c is formed of. Then, for all x € X*, 


where c is the unique successful path with (input) label x. Thus y can be computed 
using the automaton A} (X). This is especially interesting when X is finite since, in 
this case, the corresponding flower automaton is also finite. 
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2.3 
EXAMPLE 1.4 Let X = {aa, ba, bb, baa, bba} be the code of Example EF Tet B be the 
coding morphism for X defined by 


B(c) = aa, B(d) = ba, Ble) = bb, B(f) = baa, B(g) = bba. 


alo(e) 


The output labels of the automaton A} (X) are represented in Fig. bu with p —> q 
denoting that the edge e = (p,a,q) has output label o(e). 


2 Unambiguous monoids of relations 


A relation m over P and Q is a subset of P x Q. If P = Q, we say that m is a relation 
over P. If (p,q) € m, we write equivalently 


(p, Em <= (p,m,q)=1 <> pmq = p =q eS Mpq=1. (6.2.1) 


Each of these notations refers to a specific view of a relation. The forth allows to 
consider a relation as a graph, the third mimicks order relations, last refers to a view 
of a relation as a matrix. Of course, one has the negations 


(pa) ém < (p,m,q) =0 > Mmpq=0. (6.2.2) 


In these writing, 0 and 1 refer to the elements of the Boolean semiring. In particular, 
viewed as matrices, relations are Boolean matrices. Since 0 and 1 are elements of every 
semiring, each matrix can also be viewed as a matrix with entries in this semiring. 
Similarly, a row or a column of a relation is a row or a column of the corresponding 
matrix. Thus Mp: = {q E Q | mpg = 1} and Myg = {p € Q | Myg = 1}. 

Each partial function from P to Q is a particular relation over P and Q. In particular, 
a permutation of Q is a relation over Q. 

The product of a relation m over P and Q and a relation n over Q and R is the 
relation mn defined by 


(p, r) Emn = Hq EQ:(p,q) € mand (q,r)En. 


The set P(Q x Q) of relations over a set Q is a monoid for this product. The product 
is unambiguous if for each (p,r), there exists at most one q € Q such that (p,q) E m 
and (q,r) € n. If the relations are viewed as graphs, this amounts to the uniqueness 
of paths of length 2, that is p > q = r, p “> q! > r imply q = q'. Viewed as 
matrices, the definition is equivalent to the property that the value of the product of 
m and n has the same value in any semiring. In particular, viewed as matrices with 
entries in N, the sums > qEQ ™p,qMq,r take only the values 0 or 1. Another way to view 
this is to observe that if r is a row of m, and 1 is a column of n, there is at most one 
q E Q such that rg = 1, = 1. 


EXAMPLE 2.1 Let m and n be the relations given in matrix form by 
0 1 0 0 0 1 
m=]1 0 0f, n={0 0 0 
1 1 0 1 1 0 
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One checks that the product over the integers gives 
0 0 0 
mn=}]0 0 1 
00 1 


and therefore the product of the relations is unambiguous. 


An unambiguous monoid of relations over Q is a submonoid M of the monoid of 
relations over Q such that for each m,n € M, the product mn is unambiguous. As a 
submonoid of P(Q x Q) it contains the identity idg. 


EXAMPLE 2.2 Every monoid of relation over a set Q which is composed of partial 
functions is unambiguous. 


EXAMPL 2,3, Ghe reader may check that the monoid generated by the matrices of 
Example p is unambiguous and has 9 elements. 


Recall that a monoid M of relations over Q is said to be transitive if for all p,q € Q, 
there exists m € M such that (p,q) E m. 

Let A = (Q, I, T) be an automaton over A. Reacll that, for each word w, we denote 
by pa(w) the relation over Q defined by 


(p,q) € palw) —> p >q 


It follows from the definition that p4 is a morphism from A* into the monoid of relations 
over Q. 

The next statement relates monoids of unambiguous relations and unambiguous au- 
tomata. 


PROPOSITION 2.1 Let A be an automaton over A. Then A is unambiguous if and only 
if the monoid p,4(A*) is unambiguous. Moreover, if A = (Q,1,1), then A is trim if 
and only if the monoid pa(A*) is transitive. 


Proof. Assume there are paths p > r —> q and p —> r! 4 qin A. If r Æ r’, the 
product of y,(u) and y,(v) is ambiguous, and conversely. 

Next let A = (Q,1,1) be a trim automaton. Let p,q E€ Q. Let u,v E€ A* be such that 
p —> 1, and 1 => q are paths. Then p +> q is a path and consequently ppa(uv)gq. 
The converse is clear. n 


A relation m over Q is invertible if there is a relation n over Q such that mn = nm = Ig 
where Jg is the identity relation over Q. 


PROPOSITION 2.2 A relation is invertible if and only if it is a permutation. 


Proof. Let m be an invertible relation, and let n be a relation such that mn = nm = Ig. 
For all p € Q, there exists q E€ Q such that pmq, since from pmnp we get pmqnp for 
some q € Q. This element q is unique: if pmq’, then gnpmq' = qIgq', whence q = g. 
This shows that m is a function. Now if pmq and p'mq, then pmqnp and p'mqnp, 
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implying p' = p. Thus m is injective. Since nm = Ig, m is also surjective. Thus m is 
a permutation. The converse is clear. m 


Let m be a relation over a set Q. A fixed point or fixpoint of m is an element q € Q 
such that qmq. In matrix form, the fixed points are the indices q such that Mq q = 1, 
thus such that there is a 1 on the diagonal. We denote by Fix(m) the set of fixed points 
of m. 


PROPOSITION 2.3 Let M be an unambiguous monoid of relations over Q. Let e € M 
and let S = Fix(e). The following conditions are equivalent: 
(i) e is idempotent. 
(ii) For all p,q € Q, we have p > q iff there exists an s € S such that p = s 
and s —> q. 
(iii) We have 
e=lr and rl = Is (6.2.3) 


where l C Qx S andr C Sx Q are the restrictions of e to Q x S and S x Q, 
respectively. 


If e is idempotent, then moreover, we have in matrix form 


with l C (Q — S) x S, r' C S x (Q — S) and r'Y =0. 


1 
The decomposition Eor an idempotent relation is called the column-row decom- 
position of the relation. Note that 


el=1, re=r, (6.2.4) 


since for instance re = rlr = rls =r. 

Proof. (i) = (ii). Let p,q € Q be such that peq. Then peîq. Consequently, there are 
s,t € Q such that peseteq. It follows that peseq and peteq. Since M is unambiguous, 
we have s = t, whence ses and s € S. The converse is clear. 

(ii) > (iii). Let l and r be the restrictions of e to Q x S and S x Q, respectively. If 
peq, then there exists s € S such that pes and seq. Then pls and srq. Conversely if 
pls and srq, then we have peseq, thus peq. Since this fixed point s is unique, we have 
e=lr. 

Now let r,s € S with rrls. Then rrqls for some q € Q. Thus req and qes. Moreover, 
rer and ses, whence 

rereqes, reqeses 


The unambiguity implies that r = q = s. Conversely we have srls for all s € S. Thus 
rl = ids. 

(iii) > (i). We have e? = Irlr = l(rl)r = lr = e. Thus e is idempotent. 

Assume now that e is idempotent. The restriction of e to Sx S is the identity. Indeed 
ses holds for all s € S, and if ser with s,r € S, then seser and serer, implying s = r 
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by unambiguity. This shows that l and r have the indicated form. Finally, the product 
rl is 
rl = Is +r 


Since rl = Ig, this implies that r'l’ = 0, which concludes the proof. 5 


Let M be an unambiguous monoid of relations over Q and let e € M be an idem- 
potent. Then eMe is a monoid, and e is the neutral element of eMe, since for all 
m € eMe, em = me = eme = m. It is the greatest monoid contained in and having 
neutral element e. It is called the monoid localized at e (cf. Sectign IP) e F-class 


i mel at 


H(e) of e is the group of units of the monoid eMe (Proposition 08.4). 


À 


PROPOSITION 2.4 Let M be an unambiguous monoid of relations over Q, let e be an 
idempotent in M and let S = Fix(e) be the set of fized points of e. The restriction 
y of the elements of eMe to S x S is an isomorphism of eMe onto an unambiguous 
monoid of relations Me over S. Ife = lr is the column-row decomposition of e, this 
isomorphism is given by 


y:m rml. (6.2.5) 


The set Ge = rH (e)l is a permutation group over S. 
Further, if M is transitive, then Me is transitive. 


2 
Proof. Let y be the function defined by Bii n m € eMe, then for s,t € S, 
(s, y(m), t) = (s, rml, t) = (s, mM, t), 


because we have srs and tlt. Thus y(m) is the restriction of the elements in eMe to 
SxS. Further, y is a morphism since 


y(e) = rel = ids 
and for m,n € eMe, 
y(mn) = y(men) = r(men)1 = rmirnl = y(m)7(n). 


Finally y is injective since if y(m) = y(n) for some m,n € eMe, then also ly(m)r = 
ly(n)r. But ly(m)r = Irmlr = eme = m. Thus m = n. The monoid 


Me = y(eMe) 


is a monoid of relations over S since it contains the relation ids. Its elements, as restric- 
tions of unambiguous relations, are themselves unaybiguous. Finally Ge = y(H(e)) is 
composed of invertible relations. By Proposition ae a permutation group over S. 

If M is transitive, consider s,t € S. There exists m € M such that smt. Then also 
semet. Taking the restriction to S, we have sy(eme)t. Since y(eme) € Me this shows 
that Me is transitive. : 
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EXAMPLE 2.4 Consider the relation given in matrix form by 


0100 
ford 
™=19 0 0 0 
lio 1 ol 
Then 
1010 
» oaio 
m =jooo0o0 
0100 


and m? = m. 
Thus m? is an idempotent relation. The monoid M = {1, m, m?} is an unambiguous 
monoid ofrelations. 


The fixed points of the relation e = m? 


are 1 and 2, and its column-row decomposition 
is 


0 
1 
0 
1 


oo oO 


We have 


m=l1 nae r 
~ 11 0 
and the restriction of m to the set {1,2} is the transposition (12). The monoid M, is 
equal to the group Ge which is isomorphic to Z/2Z. ° 


Let M be an arbitrary monoid. We compare now the localized monoids of two 
idempotents of a D-class. Let e, e’ be two D-equivalent idempotents of M. Since, by 
definition, D = RL, there exists an element d € M such that eRdLe’. By definition of 
these relations, there exists a quadruple 


(a,a',b, b") (6.2.6) 
of elements of M such that 
ea = d, da’ = e, bd = e’, bve =d (6.2.7) 
a) 
(see Fig. ETA. The quadruple (ras a passing s sjem from e to e’. 
The following formulas are easily derived from ( {Note that most of these iden- 
tities appear in Section Hier 
eaa =e, bea=c', ea= be, (6.2.8) 
and 
be’ =e, b'ea =e, be= ea (6.2.9) 
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Eqpassing2 
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Figure 6.12: The passing system. Right mutiplication by a or a’ is represented by a 
horizontal arrow and left multiplication by b or b’ is represented by a vertical arrow. 4_17 


(the last formula is obtained by be = bb’e’a’ = e/a’). Since e and e’ are idempotents, 


the following hold also: 
eabe = e, cable = e. (6.2.10) 


sing1 . 
Indeed, we have by a ,e = ele’ = beabea. Thus b'e'a' ai, y beabeaa’. Since 
be = e'a! by fan one has b/be = b'e'a’ = e and since by (ape eaa = e, we obtain 
b'e'a! = e = eabe. This proves the first equality. The second one is proved in the same 
way. 
Two monoids of relations M over Q and M’ over Q’ are equivalentthere exists a 
relation 6 € P(Q x Q’) which is a bijection from Q onto Q’ such that the function 


mr 6'mé 


is an isomorphism from M onto M’ (6° is the transposed of 0). Since 0 is a bijection, 


we have 6t = 07t . Therefore, in the case where M. and M gre permutation groups, 


this definition coincides with that given in Section QH. 


PROPOSITION 2.5 Let M be an unambiguous monoid of relations over Q, and let e,e' € 
M be two D-equivalent idempotents. Then the monoids eMe and e'Me' are isomorphic, 
the monoids Me and M. an equivalent, and the groups Ge and Ge are equivalent 
permutation groups. More precisely, let S = Fix(e), S’ = Fix(e’), let e = lr, e = l'r’ be 
their column-row decompositions, let y and y’ be the restrictions to S x S and S' x S” 
and let (a,a',b,b') be a passing system from e to e’. Then 

1. The function T : m => bma is an isomorphism from eMe onto e' Me’. 
2. The relation 0 = ral’ = rb'l € P(S x S’) is a bijection from S onto S'. 
3. The function T' : n> OnO is an isomorphism from Me onto Me. 

4. The following diagram is commutative 


z 
eMe — e'Me' 


a 


Me — Me 


ing1 
Proof, 1. etm € eMe. Then T(m) = bma = bemea = e'a'mb'e', since by Esg = 
and (ARa e'a' and b'e’ = ea. This shows that T(m) is in e’Me’. Next r(e) = 
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singi ing3 
bea = e’ by Esg for m,m’ € eMe, we have by ( 


T(m)r(m') = bmabm'a = bmeabem'a = bmem'a = bmm'a = T(mm’). 
Thus 7 is a morphism. Finally, it is easily seen that 
m = b'm'ad' 


is the inverse function of 7; thus 7 is an isomorphism from eMe onto e Me’. ibi 
2. We have eae’ = eb'e'. Consequently reae’l’ = reb’e'l’. Since by paer = 
r,€ Y =I’, we get that 
0 = ral = rb'Y 


The relation @ is left invertible since 

(r’b1)0 = r'blral = r'beal’ = r'e'l = idg 
and it is right-invertible, since we have 

O(r'a‘l) = rb'l'r'a’1 = rb'e'd'l = rel = ids 
Thus @ is invertible and consequently is a bijection, and 

0 = r'd'l = r'bl 
4. For m € eMe, we have 
t'y(m) = (rbl) (rml) (ral) = r'bemeal’ = r' (bma)! = y'T(m) 


showing that the diagram is commutative. 
3. Results from the commutativity of the diagram and from the fact that y, 7, y” are 
isomorphisms. m 


EXAMPLE 2.5 Consider the matrices 


000 1 001 0 
11010 11010 
S10, 6.06)? So o 
0100 0100 
They generate an unambiguous mongi ph relations (as we may verify by using, for 
instance, the method of Proposition j. The matrix 
0 1 0 0 
E 1 0 10 
“= 0 0 0 0 
1010 
3.1 
is the matrix m of Example Pat the element 
101 0 
0 1 0 0 
= 2 
SG: G06 
0 1 0 0 
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is an idempotent. Fix(e) = {1,2}, and the column-row decomposition is 
i 0 1 i an 


The matrix 


fan) 

| 

Pa 

z 

S 

xN 

N 

| 
| 

— 
ooo°o 
oF HRS 
SS 
——— 


is also an idempotent. We have Fix(e’) = {3,4}, and e’ has the column-row decompo- 
sition 


0 0 

,_ {1 of f1 01 0]_,,, 

e= allo o ol 
0 1 


The idempotents e and e’ lie in the same D-class. Indeed, we may take as a passing 
system from e to e’ the elements 


/ 
a=b =u, a = b = vw. 


The bijection 6 = ral’ from the set Fix(e) = {1,2} onto the set Fix(e’) = {3, 4} is 
0:1= 4,2—3 e 


We now describe a useful method for computing the permutation group Ge for an 
idempotent e in a unambiguous monoid of relations. This method requires us to make 
a choice between “left” and “right”. We first present the right-hand case. 

Let M be an unambiguous monoid of relations, and let e be an idempotent element 
in M. Let R be the R-class of e, let A be the set of H-classes of R and let G = H(e) 
be the H-class of e. For each H € A, choose two elements ay,a‘, E M such that 


eay € H, ea gay = €; 


with the convention that 
ag = ag =e. 


(see Figure ETS). Such a set of pairs (ay, a'y) HEA is “He a, aystem of coordinates of 


R relatively to the idempotent e. Then, by Proposition 0 an = H and Ha; = G 
since the elements ay, a‘, realize by right multiplication two reciprocal bijections from 
G onto H. 

Let e = Ir be the column-row decomposition of e, and set 


ry = ray and ly = d'yl for H €A. (6.2.11) 


ibis 
Note that then the equality ry = reay follows from r = re, which is (Saat = 
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Each m € M defines a partial right action on the set A by setting, for all H € A 


Hm= g” if HmEA, 


0 otherwise. (6.2.12) 


Now we define a partial function from A x M into G by setting 


= rymMlym, if Hm € A, 
Hem= {4 am (6213) 


First, observe that H -m Æ Ø implies H xm € Ge. Indeed, set H’ = Hm. From 
eag € H we get eagm € H’, showing that 


eaymay € G. 
It follows that 


Hxm=rgmly = (reap)m(al) 
= r(eaymay)l E€ Ge. 


Mh. 
e H 
a. 


Figure 6.13: Two coordinates. The pair (aH, ay) satisfies eag € H and ea Hay =e. |4_18 


© m/H * m © m/H’ * n (4) 


Figure 6.14: Composition of outputs. The label of an edge from H to H -m is the pair 
(m, H x m), denoted m/H * m. 4_19 


Observe also that for all H € A, 


H-1=H and H*1l=e. (6.2.14) 


Before stating the result, let us show the following relation. For all m,n € M, 


(H «m)(H-m*n) = H * mn. (6.2.15) 


This formula shows that the functions (H,m) +> H -m and (H,m) +> H *m can be 
viewed as a transducer j the sense detailed in the Notes. 

To verify formula ES let H' = Hm, H” = Hmn (the cases where H -m = @ or 
H -mn = (0 are straightforward). See Fig. . We have 


Hxm H' xn) = rymlgyrgnlyr = rymd ilragnlyr 
H 
= ra HMa'g ea H'Na'gnl 


= r((eayma'y je)a g'naygnl. 
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-Ibis 
(We have used Ei Since eayma'g, € G, we have eayma'gye = eayma',,. Thus 
(H «m)(H' xn) = r((eagym)a pag )nayrl. 


Since eaym € H’ and the multiplication on the right by aʻyapr is the identity on H’, 
we get 
* m * n) = rea gmna yil = rymnlyr = H * mn. 
(H x m)(H' xn) 7 lye 


This proves formula (ata. As a consequence, we have the following result: 


PROPOSITION 2.6 Let M be an unambiguous monoid of relations generated by a set 
T. Lete be an idempotent of M, let R be its R-class, let A be the set of H-classes 
of R and let (ay,a',)Hea be a system of coordinates of R relatively to e. Then the 
permutation group Ge is generated by the elements of the form Hxt, for H € A, teET, 
and H xt £ Ú. 


Proof. The elements H xt, for H € A and t € T either are 0 or are in Ge. 
Now let g be an element of H (e). Then 


g = titz: tn, ti ET, 
because T generates M. Let G = H (e) and let 
H; = Gtit2--- ti, Larsi. 


fe Gg = G it follows that Hjtj41---t, = G. Thus H; € A and G-t,---t; = H;. By 


? 


G x g =(G*t,)(My * t2) --- (Hn-1 * tn) 
But G x g = rgl. This shows the result. n 
The pair of partial functions 
AxM-—A, AxM—G. 


6 7 
defined by (Zai and (2TH is called the R-representation of M relatively to e and 


to the coordinate system (ap, a'g) HEA. 
Let 0 be a new element such that 0g = g0 = 00 = 0 for all g € Ge. The function 


u: M ee (Ge U |) cae 
which associates to each m € M the A x A-matrix defined by 


Gite _f Hxm if Hm = H’, 
pee =) o otherwise, 


is a morphism from M into the monoid of row-monomial A x A- atrjcgs with elements 
in Ge U0. This is indeed an equivalent formulation of formula (ap 

Symmetrically, we define the £L-representation of M relatively to e as follows. Let L 
be the £-class of e, and let I be the set of its H-classes. For each H € I’, choose two 
elements by, by E€ M such that 


bye € H, bybge =e, 
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with bg = bg = e. Such a set pairs (bi, b+) Her is called a system of coordinates of 
L with respect to e. As in ( , we set 1! = byc, r” = rb), for H ET. 
For each m € M, we define a partial left action on I by setting, for H €T, 


mH if mH ET, 
aa { 0 otherwise (6.2.16) 
and a partial function from M xT into Ge by setting 
rH ml”, ifm-mH ET, 
i { 0 otherwise 02) 
8 
Then formula (tay becomes 
(n*xm-H)(mx*x H) = nm» H (6.2.18) 


and Proposition Pios mutatis mutandis. 

Note that for the computation of the £-classes and the R-classes of an unambiguous 
monoid of relations, we can use the following observation, whose verification is straight- 
forward: If m£n (resp., if mRn), then each row (resp., column) of m is a sum of rows 
(resp., columns) of n and vice versa. This yields an easy test to conclude that two 
elements are in distinct £-classes (resp., 7-classes). 


3.2 
EXAMPLE 2.6 Let us consider again the unambiguous monoid of Example Est gener- 
ated by the matrices 


0 0 0 1 00 10 
|1010 |1010 
““loo0o00]’ ”7j1ı000 

0100 0 1 0 0 

We consider the idempotent 
1 01 0 
0 1 0 0 
— 2 
ee Ny OG 
0 1 0 0 


Its R-class R is formed of three H-classes, numbered 0,1,2. In Fig. TE a represen- 
tative is given for each of these H-classes. The fact that the £-classes 


e eu eu2 
0 1 2 
er, 
U,V 


Figure 6.15: The R-class of idempotent e. 
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are distinct is verified by inspecting the rows of e, eu, eu”. Next, we compute that eu? = 


eu?v = e, showing that these elements are R-equivalent. Further, euv = (uv)?He. 


Finally 


1010 
Bee] 
oF | oO OO 

ETS 


has only one nonnull row (column) and consequently cannot be in the D-class of e. We 
have reported in Fig. the effect of the right multiplication by u and v. 
We choose a system of coordinates of R by setting 


ao = Q =e, 
ay = u, a) = vw 
a = w, ay = u 
Then 
1 0 
ro=| ¢ eal foal 
0 0100 0 0 0 
[ot 
0 0 
~ [9001 ralada 
1 1010 í 0 1 
1 0 
0 1 
ou eee Oo wie 
? 0001 2 0 0 
o a 


Let us denote by H EER H' the fact that H-t = H’ and H *t = g. Then the R- 


representation o relatively to e and to this system of coordinates is obtained by 
completing Fig. and is given in Fig. . We set 


[bo] [23] 


The group Ge is of course Gg. 


The concepts introduced in this paragraph are greatly simplified when we consider 
the case of a monoid of (total) functions from Q into itself, instead of a monoid of 
unambiguous relations. 

For a € M, write pa = q instead of (p,a,q) = 1. 

The image of a, denoted Im(a), is the set of q E€ Q such that pa = q for some p E€ Q. 

The nuclear equivalence of a, denoted Ker(a), is the equivalence relation on Q defined 
by p = q mod Ker(a) iff pa = qa. 
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Figure 6.16: The -representation of M. 


If b € Ma, then Im(b) C Im(a). If b € aM, then Ker(a) C Ker(b) (note the inversion 
of inclusions). 

A function e € M is idempotent iff its restriction to its image is the identity. Thus, 
its image is in this case equal to its set of fixed points: Im(e) = Fix(e). 

As a result of what precedes, if a£b, then Im(a) = Im(b) and if aRb, then Ker(a) = 
Ker(b). This gives a sufficient condition to ensure that two elements are in different 
L-classes (resp. R-classes). 

To compute the R-class of an idempotent function e over a finite set, we may use the 
following observation, where S = Fix(e). If the restriction of m to S is a permutation 
on S, then eHem. Indeed, the restriction of m to S is a permutation on S, thus em? = e 
for some p, therefore emm?—! = e and thus emHe. 


EXAMPLE 2.7 Let M be the monoid of functions from the set 


Q = {1,2,...,8} 


into itself generated by the two functions u and v given in the following array 


1 2 3 4 5 6 7 8 
u 4 5 4 5 8 1 8 1 
v 2 3 4 5 6 7 8 1 


where each column contains the images by u and v of the element of Q placed on the 

top of the column. The function e = uft is idempotent and has the set of fixed points 
S = {1,4,5,8}, 

1 2 3 

u 1 4 #1 


4 5 6 7 
4 5 5 


8 
8 8 
We get the pattern of Fig. oTi for the R-class R of e. These four H-classes are distinct 
because the images of e, ev, ev”, ev are distinct. For the edges going back to the H- 
class of e, we use the observation stated above; it suffices to verify that the restrictions 


to S of the functions u, vu, v2u, võu, v are permutations. 


Figure 6.17: The R-class of e. 
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Choose a system of coordinates of R by taking 


ao 
ai 
a2 
a3 


ap = €, 


V, 


v, 


a = a 
a, = vÊ 
az = v. 


289 


For the computation of the R-representation of M relatively to e, we proceed as follows: 
if H -m = H’, then the permutation H * m on S i not computed by computing the 
matrix product H xm = rgmly of formula 
the restriction to S of eayma'ye, by evaluating this function on S. Thus we avoid 


unnecessary, matrix computations when dealing with functions. 


Figure 


3.6 
According to Proposition Prot the group Ge is generated by the permutations 


It is the dihedral group D4 which is the group of all symmetries of the square 


It contains 8 elements. 


, but, observing that H x m is 


shows the 7-representation obtained. 


u/(1458) 


u/(15)(48) 


u,v/(15)(48) 


Figure 6.18: The R-representation. 


(1458), 


(15)(48), 
1—4 
|| 
8—5 


3 Rank and minimal ideal 


(14)(58). 


Let m be a relation between two sets P and Q. The unambiguous rank of m, or rank, 
for short is the minimum of the cardinalities of the sets R such that there exist two 
relations 1 € W(P x R) and r € B(R x Q) with 


m=lr, 


(6.3.1) 


and such that the product Ir is unambiguous. The raņk, is denoted by rank(m). It is a 
Ei 


nonnegative integer or +00. A pair (l, r) satisfying 


is a minimal decomposition if 


there exists no unambiguous factorization m = l'r’ with l € P(P x R’), r' e B(R’ x Q) 
and R' Ç R. If rank(m) is finite, this is the equivalent of saying that Card(R) is 


minimal. 
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EXAMPLE 3.1 The relation 


0 1 0 0 1 
m= |1 0 0] =]1 0 k : i 
1 1 0 1 1 
has rank at most 2 in view of the above decomposition. It has not rank 1 because it 
has two distinct nonzero columns. Thus, m has rank 2. e 


The following properties are used frquently. First, if the product nmn’ is unambiguous, 
then 
rank(nmn’) < rank(m). (6.3.2) 


Indeed, each decomposition (1,r) of m induces a decomposition (nl,rn’) of nmn’. If 


l l 
p = s — t 5 u = q and p & 5!’ — t > u! “> q, then s = s' and u = u’ by 


the unambiguity of the product nmn’. The unambiguity of the product Ir forces t = t’. 
Second 


rank(m) < min{Card(P), Card(Q)}. 
If (l, r) is a minimal decomposition of m, then 
rank(m) = rank(l) = rank(r) . 
Further 
rank(m) = 0 &m=0. 
If P’ c P, Q' CQ, and if m is the restriction of m to P’ x Q’, then 
rank(m’) < rank(m) . (6.3.3) 


We get from the first inequality that two J-equivalent elements of an unambiguous 
monoid of relations have the same rank. Thus, the rank is constant on a D-class. 
Consider two relations m € $B(P x S) and n € P(S x Q). The pair (m,n) is called 
trim if no column of m is null and no row of n is null. This is equivalent to say that for 
all s € S, there exists at least one pair (p,q) € P x Q such that p —", sand s —> q. 


PROPOSITION 3.1 Any minimal decomposition of a relation is trim. 


Proof. Le lr be a minimal decomposition of a relation m. Assume that l contains a 
column which is null. Then we can delete this column and the row of same index of r 
without changing the value of the product. But this implies that (l, r) is not a minimal 
decomposition. Thus no column of l is null, and symmetrically no row of r is null. 
Consequently (1,r) is trim. . 


PROPOSITION 3.2 For each set Q, rank(idg) = Card(Q). 


Proof. Let id = Ir be a minimal decomposition of idg, with 1 € P(Q x P) and 
r € B(P x Q). Let p € P. Since the pair (l,r) is trim, there exist q,q’ € Q such 
that q ht p => q. Since Ir = idg, one has q = q', and there is no q” 4 q such that 


p — q". Thus r defines a mapping from P into Q. This mapping is surjective since 
idg is surjective. This imples that Card(P) = Card(Q). n 
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PROPOSITION 3.3 A permutation on Q has rank Card(Q). 
.4.2bis 
Proof. Let m be pepmutation on Q and let n be its inverse. Then by Proposition Ba 


and Equation ; 
Card(Q) = rank(idg) = rank(mn) < rank(m) . 


Thus rank(m) = Card(Q). m 


EXAMPLE 3.2 The rank of a partial function m from P to Q is 
rank(m) = Card(Im(m)). 


Let m’ be the restriction of m to P x Im(m). Then m = m’r, where r is the restriction 
of idg to Im(m). This shows that rank(m) < Card(Im(m)). The partial function 


m’ contains a bijection n of a cross-section of m onto Im(m) obtained by cho sing or 
one element, ip (P. for each set m'(q), with q € Im(m). By Proposition an 
Be rank 


Equation (m) > rank(n) = Card(Im(m)). l 
2.6 
Thus the notion of rank that we defined in Section i it comades with the notion 
defined here. 


Let us observe that the rank of a relation m over a finite set Q has strong connections 
with the usual notion of rank as defined in linear algebra. Let K be a field containing N. 
The rank of a matrix over K (or K-rank) is, as is well known, the maximal number of ia 
rows (or columns) which are linearly independent over K. We can observe (Exercise e 
that this number may be defined in a manner analoguous to the definition of the rank 
of a relation. In particular, 

rankg(m) < rank(m). 


4.4.2 
It is easy to see (Exercise B that the inequality usually is strict. However, in the case 
of relations which are functions, the two notions coincide. 
The following proposition gives an easy method for computing the rank of an idem- 
potent relation. 


PROPOSITION 3.4 Lete be an idempotent element of an unambiguous monoid of rela- 
tions. Then 
rank(e) = Card(Fix(e)). 


Proof. Set S = Fix(e). The column-row 7 ee of e shows that rank(e) < 


Card(S). Moreover, in view of Proposition 2-3, the matrix, e ,c ntains the identity 
matrix Ig. Thus Card(S) = rank(Is) < rank(e) by Equation eee n 


The following statement gives a characterization of relations of finite rank. 


PROPOSITION 3.5 For any relation m, the following conditions are equivalent: 
(i) m has finite rank, 
(ii) the set of rows of m is finite, 
(iii) the set of columns of m is finite. 
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Proof. (i) = (ii). Let m = Ir, with 1 € $(P x S) and r € P(S x Q) be a minimal 
decomposition of m. If two rows of l, say with indices p and q, are equal, then the 
corresponding rows Mps and Mgs of m also are equal. Since S is finite, the matrix l has 
at most 2Card(S) distinct rows. Thus the set of rows of m is finite. 

(ii) > (i). Let (m54)reg be a set of representatives of the rows of m. Then m = Ir, 
where r is the restriction of m to S x Q, and 1 € P(Q x S) is defined by 


L.- 1 if Mgs = Msz, 
o o0 otherwise. 
This shows (i) (ii). The proof of (i) (iii) is identical. m 


PROPOSITION 3.6 Letm be a relation over a set Q of finite rank. Then the semigroup 
generated by m is finite. 


Proof. Let m = Ir be a minimal decomposition of m, with 1 € P(Q x R) and r € 
(Rx Q). Let u be the relation over R defined by u = rl. Then for all n > 0, 


mt = I(rl)”r = lu"r. 


Since R is finite, the set of relations u” is finite and the semigroup {m” | n > 1} is 
finite. n 


In particular it follows from this proposition that for any relation of finite rank, a 
convenient power is an idempotent relation. 

Let M be an unambiguous monoid of relations over Q. The minimal rank of M is the 
minimum of the ranks of the elements of M other than the null relation. It is denoted 
by r(M), 

r(M) = min{rank(m) | m € M \ {0}. 


If M does not contain the null relation over Q, this is of course the minimum of the 
ranks of the elements of M. One has r(M) > 0 if Q £0 and r(M) < œ if and only if 
M contains a relation of finite positive rank. 

We now study the monoids having finite minimal rank and we shall see that they 
have a rather precise structure. We must distinguish two cases: the case where the 
monoid contains the null relation, and the easier case where it does not. 

Note that the null relation plays the role of a zero in view of the following, more 
precise statement: If a transitive unambiguous monoid of relations over a nonempty 
set Q contains a zero, then the zero the null relation. 

Indeed, the null relation always is a zero. Conversely, if M has a zero z, let us prove 
that z is the null relation. If Card(Q) = 1, then z = 0. Thus we assume Card(Q) > 2, 
and z #0. Let p,q E€ Q such that 2p, = 1. Let r,s € Q. By transitivity of M, there 
exist m,n E€ M such that 

Mrp = Nqs = 1. 
From mzn = z, it follows that zrs = 1. Thus zrs = 1 for all r,s € Q, which contradicts 
the unambiguity of M. 
Let M be an unambiguous monoid of relations over Q. For each q € Q, let stabilizer 


of q is the submonoid 
Stab(q) ={meM|q—>q}. 
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THEOREM 3.7 Let M be a transitive unambiguous monoid of relations over Q, con- 
taining the relation 0, and having finite minimal rank. Let K be the set of elements of 
M of minimal rank r(M). 

1. M contains a unique 0-minimal ideal J, which is K U {0}. 

2. The set K is a regular D-class whose H-classes are finite. 

3. Eachq E Q is a fixed point of at least one idempotent e in K i.e., e € KNStab(q). 
. For each idempotent e € K, the group Ge is a transitive group of degree r( M). 
. The groups Ge, for e idempotent in K, are equivalent. 


ons 


Before we proceed to the proof, we establish several preliminary results. 


PROPOSITION 3.8 Let M be an unambiguous monoid of relations over Q, and lete € M 
be an idempotent. If e has finite rank, then the localized monoid eMe is finite. 


4.3 
Proof. Let S be the set of fixed points of e. By Proposition Bea the set S is finite. Thus 
the monoid Me w ig is an unambiguous monoid of relations over S, is finite. Since, 
by Proposition B.o} the monoid eMe is isomorphic to Me, it is finite. n 


We now verify a technical lemma which is useful to “avoid” the null relation. 


LEMMA 3.9 Let M be a transitive unambiguous monoid of relations over Q. 


1. For allm € M \O, there existn € M and q E€ Q such that 
mn € Stab(q) (resp. nm E€ Stab(q)). 


Thus in particular mn 4 0 (resp., nm 4 0). 
2. For allm € M \ 0 andq € Q, there exist n,n’ € M such that 


nmn’ € Stab(q) . 
3. For all m,n € M \ 0, there exists u E€ M such that 
mun #0. 


In other terms, the monoid M is prime. 


Proof. 1. Let q,r € Q be such that (q,m,r) = 1. Since M is transitive, there exists 
n E€ M such that (r,n,q) = 1. Thus (q,mn,q) = 1. 

2. There exist p,r € Q such that (p,m,r) = 1. Let n,n’ € M be such that (q,n,p) = 
1, (r, n',q) =1. Then (q,nmn’,q) = 1. 

3. There exist p,r,s,q E Q such that (p,m,r) = (s,n,q) = 1. Take u € M with 
(r,u,s) =1. Then (p,mun,q) =1. n 


PROPOSITION 3.10 Let M be a transitive unambiguous monoid of relations over Q, 
having finite minimal rank. Each right ideal R #0 (resp., each left ideal L # 0) of M 
contains a nonnnull idempotent. 
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Proof. Let r €e R\ 0. By Lemma Brot there exist n € M and q E Q such that 

rn € pee Let m € M be an element such that rank(m) = r(M). Again by 

Lemma ere exist u,v E€ M such that umv € Stab(q). Consider the element 

m' =rnumv. Then m’ € R and m’ € Stab(q). De 
Since rank(m’) < rank(m), the rank of m’ is finite. According to Proposition Bo 

the semigroup generated by m’ is finite. Thus there exists k > 1 such that e = (m’)* 

is idempotent. Then e € R and e £0 since e € Stab(q). n 


PROPOSITION 3.11 Let M be a transitive unambiguous monoid of relations over Q, 
having finite minimal rank and containing the null relation. For allm € M, the 
following conditions are equivalent: 

(i) rank(m) = r(M), 
(ii) the right ideal mM is 0-minimal, 
(iii) the left ideal Mm is 0-minimal. 


Proof. (i) = (ii) Let R F { 9 right ideal contained in mM. We show that R = mM. 
According to Proposition , R contains an idempotent e # 0. Since e € RC mM, 
there exist n € M such that e = mn. Since rank(e) < rank(m) and rank(m) is 
minimal, we have rank(e) = rank(m). Let m = lr be a minimal decomposition of m, 
with le P(Q x S), r e P(S x Q). Then e = (Ir)n = l(rn). The product l(rn) is easily 
checked to be unambiguous. Since rank(e) = r(M) = Card(S), the pair (l, rn) is a 
minimal decomposition of e. For all k > 0, 


e = ett! = I(rnl)*rn 
with all products unambiguous. Since S is finite, there exists an integer 7 > 1 such that 
(rnl)’ is an idempotent element of the unambiguous monoid of relations on S composed 
of the powers of rnl. Since rank((rnl)') = Card( S), each element in S is a fixed point 
of (rnl)’. Consequently (rnl)' = ids. Thus 


em = em = (Irn)'m = (Irn)'Ir = l(rnl)'r = lr = m 


The equality em = m shows that m € R, whence R= mM. Thus mM is a 0-minimal 
right ideal. ae 

(ii) = (i) Let n € M be such that rank(n) = r(M). By Lemma Bot there exists 
u E€ M such that mun ~ 0. From munM C mM, we get munM = mM, whence 
m € munM. Thus rank(m) < rank(n), showing that rank(m) = rank(n). 

(i) = (iii) is shown in the same way. n 


-4.5 
Proof of Theore n 
1. By LemmaB-9 the monoid M is prime. According to Proposition ZLM 


M contains 0- a nina left and right ideals. In view of Corollary O18. 10, 
contains a unique 0-minimal ideal J which is pa z nign of the 0-minimal right ideals 


(resp., left ideals). Once more by Proposition is the union of 0 and of the set 


K of elements of minimal, positi vO gank. This proves claim 1. 
2. In view of Corollary DERE E so, set is as a,regular D-class. All the H-classes of K 
have same cardinality by Proposition e finiteness of these classes will result 


from claim 4. 
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Since the semigroup generated by m = nkn’ is finite (Proposition , It contains an 
idempotent e. Then e € K N Stab(q). 

4. Let e be idempotent in K. Then the H-class of e is H U 0 = eM N Me = eMe = 
H(e) U0. The first equality is a result of the fact that the R-class of e is eM — 0. 
Next eMe C eM N Me, and conversely, if n € eM Me, then en = ne = n whence 
n = ene € eMe. This shows t e şeçond equality. Finally, H (e) = H since H is a group. 

According to Proposition 2.4, we have Me = Ge U0 and Me is transitive. Thus Ge is 


a transitive permutation group. Its degree È xM 2. 


5. Is a direct consequence of Proposition 


-4.8 
3. Let q E€ Q and k € K. By Lemma EA € Stab(q) i: Deg n,n’ € M. 


Now let M be a nambiguous monoid of relations that does not contain the null 
relation. Theorem admits a formulation which is completely analogous, and which 
goes as follows. 


THEOREM 3.12 Let M be a transitive unambiguous monoid of relations over Q which 
does not contain the null relation and which has finite minimal rank. Let K be the set 
of elements of minimal rank r(M). 
1. The set K is the minimal ideal of M. 
2. The set K is a regular D-class and is a union of finite groups. 
3. Each q € Q is the fixed point of at least one idempotent e in K i.e. e € 
K A Stab(q). 
4. For each idempotent e € K, the group Ge is a transitive group of degree r(M), 
and these groups are equivalent. 


Proof. Let Mo be the unambiguous monoid of relations 
Mo = M VO. 


We have r(M) = r(Mo). Thus Theorem apne to Mo. For all m in M, we have 
mMəo = mM U0. It follows easily that mM is a minimal right ideal of M iff mMo is a 
0-minimal right ideal of Mo. The same holds for left ideals and for two-sided ideals. In 
particular, the 0-minimal ideal J of Mo is the union of 0 and of the minimal ideal K 
of M. This proves 1. Next K is a D-class of Mọ thus also of M. Since the product of 
two elements of M is never 0, each H-class of K is a group. This proves 2. The other 
claims require no proof. m 


Let M be a transitive unambiguous monoid of relations over Q, of finite minimal 
rank, and let 
K = {m € M | rank(m) = r(M)} 


The groups Ge, for each idempotent e in K, are equivalent transitive permutation 
groups. The Suschkewitch group of M is, by definition, any one of them. 


4 Very thin codes 


A code X C A? is called very thin if there exists a word x in X* which is not a 
factor of a word in X. Recall that F(X) is the set of factors of words in X, and that 
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F(X) = A* — F(X). With these notations, X is very thin iff 
X*N F(X) #4. 


Any very thin code is thin (ie., s tisfjes F(X) # 0). Conversely, a thin code is not 
always very thin (see Example Rei However, a thin complete code X is very thin. 
Consider indeed a word w € F(X). Since X is complete, there exist u,v € A* such 
that uwv € X*. Then uwv € X* N F(X). 

The aim of this section is to prove the following result. It shows, in particyl ‘ybbat 
a recognizable code is very thin. This is more precise than Proposition fete which 
only asserts that a recognizable code is thin. 


THEOREM 4.1 Let X C A* be a code and let A = (Q,1,1) be an unambiguous trim 
automaton recognizing X*. The following conditions are equivalent. 

(i) X is very thin. 

(ii) The monoid vy 4(A*) has finite minimal rank. 


The proof of this result is in several steps. We start with the following property used 
to prove the implication (i) = (ii). 


PROPOSITION 4.2 Let X C At be a code and let A = (Q,1,1) be an unambiguous trim 
automaton recognizing X*. For all w € F(X), the rank of p(w) is finite. 


Proof. Let us write y instead of y4. For each p E€ Q, let ®(p) be the set of prefixes of 
w which are labels of paths from p to 1: 


(p) = {u € A* | u < w and py(u)1} 


We now show that if ®(p) = ®(p’) for some p, p' € Q, then the rows of index p and p’ 
in y(w) are equal. Consider a q € Q such that 


pylw)q. 


Since the automaton is trim, there exist v, v’ € A* such that 1ly(v)p and qy(v’)1. Thus 
ly(vwv’)1 and consequently vw’ € X*. Since w € F(X), the path p “+ q is not 
simple; therefore there exist u,u’ € A* such that w = uu’ and vu,u'v’ € X*. Thus 
there is, in A, the path 


f / 


1 >p a E 


By definition, u € ®(p). Thus u € ®(p’). It follows that p’p(u)ly(u’)g, and conse- 
quently p'y(w)q. This proves the claim. 

The number of sets ®(p), for p € Q, is finite. Acco ding to the claim just proved, the 
set of rows of y(w) also is finite. By Proposition Betts implies that y(w) has finite 
rank. n 


EXAMPLE 4.1 Let X be the code X = {a"ba" | n > 0}. This is a very thin code since 


b? c X* A F(X). An automaton recognizing X* is given in Fig. . The image e of 
b? in the associated monoid of relation is idempotent of rank 1. The finiteness of 
the rank also follows from Proposition ‘since b? is not factor of a word in X. The 
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localized monoid eMe is reduced to e and 0 (which is the image of b?ab, for example). 
The monoid M has elements of infinite rank: this holds for the image of q. Indged, 
clearly no power of this element can be idempotent; hence by Proposition Te 
infinite rank. Moreover, M has elements of finite rank n for each integer n > 0: the 
image of ba”ba”b has rank n + 1, as the reader may verify. 


Figure 6.19: An automaton for X* 


PROPOSITION 4.3 Let X be a code over A, let A = (Q,1,1) be an unambiguous trim 


automaton recognizing X*, let p be the associated representation and M = y(A*). 
For each idempotent e in y(X*) with finite rank such that the group G is transitive, 
the following assertions hold. 
1. There exist v1,V2,--.,Un+1 € Y '(H(e)) with the following property: for all 
y,z E€ A* such that 
YVI? +++ Un41z E X” 


there is an integer i, (1 < i < n) such that: 
YVI? -+ Vi, Vi+1 tt Ungiz E X“. 
2. The set gt (e)N F(X) is nonempty. 


Proof. Let e = Ir be the column-row deco sition of e, let S be the set of its fixed 
points and let G = H(e). By Proposition Byte restriction y : eMe — Me is the 
isomorphism m — rml, and its inverse is the function n —> Inr. 

The set S contains the element 1, since e € y(X*). Set S = {1,2,...,n}. We first rule 
out the case where y~!(e) = {1} . Then e is the neutral element of M. Thus S = Q. 
Since H(e) = {1} and G — e is assumed to be transitive, this forces A = X. Thus the 
result holds trivially. 

We now assume that y~!(e) 4 {1} . Choose elements go, 93,..-,9n E€ Ge such that 


292 = 1, 39293 = 1, ..., 2g293°°+Gn = 1. 


These elements exist because Ge is a transitive permutation group. The permutations 
92,93,- --, gn are the restrictions to S of elements h2,h3,...,hn of H(e) and one has 
hi = lgir. Thus g; = rhjl = y(h;). Let v1, v2,...,Un41 E At be such that 


(v1) = p(Un41) =e, (v2) Shay aes (Un) = hn. 


Set w = v1v2+++Un41- Consider words y, z E€ A* such that ywz € X*. Then there exist 
p,q E€ Q such that 


1 Yen Zq Žž, 1. 
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Note that 


Since py(w)q, there exist r,s € S such that p A, r, T92- gn = 8, and s —> q. Then 
rg2':: gr = 1 (with g2---gr = idg when r = 1). Since the g;’s are permutations, this 
implies 

lgr+1't: 9n = 8, 


hr4ihn 


ho-wh : 
Consequently r %1,1 > s, and since lpr = €pr, V's,q = €s,q, We have 


eh2--hr hr417hne 
oo =- 


o 1 
This implies that 
YVIVZ ` Ur, Ur41 + Un41z E X*. 


Thus the words v1,--- ,Un+1 satisfy the first statement. 

To show the second part, we verify first that the word w = v1, v2-++Un41 is in F(X). 
Assume indeed that ywz € X for some y,z € A*. Then there exists an integer i 
(1 <i < n) such that yvi- -Ui Vipi: Ungiz € X*. Since v1,...,Un41 E€ AF, these 
two words are in fact in X*, contradicting the fact that X is a code. Thus w € F(X). 

Let h’ be the inverse of h = y(w) in H(e), and let w’ be such that y(w’) = h’. Then 
ww’ € g7} (e), and also ww’ € F(X). This concludes the proof. n 


oped e .5.2 
Proof of Theorem Fr i (1) = (ii) Let x € X*M F(X). According to Proposition fea the 


rank of y(x) is finite. Since x € X*, we have (1,y,4(X),1) = 1 and thus y4(x) 4 0. 
This shows that y,4(A*) has finite minimal rank. 

(ii) > (i). The monoid M = y4(A*) is a transitive unambiguous monoid of relations 
having finite minimal rank r(M). Let 


K = {m € M | rank(m) = r(M)}. 


4.5 4.11 
By Theorems B and BTA Tiere exists an idempotent e in K tak(1), and the 


permutation group Ge is transitive of degree r(M). By Proposition 3, the set yy (e)n 
F(X) is not empty. Since yy (e) C X*, the code X is very thin. m 


.5.1 
We now examine a series of consequences of Theorem fr it 


COROLLARY 4.4 Let X be a complete code, and let A = (Q,1,1) be an unambiguous 
trim automaton recognizing X*. The following conditions are equivalent. 

(i) X is thin. 

(ii) The monoid p,(A*) contains elements of finite rank. 


Proof. Since X iş gomplete, the monoid y,4(A*) does not contain the pull relation 
(Proposition ~ Thus the result follows directly from Theorem EI ~ 7 


5.1 
Another conse ance of Theorem E i is an algebraic proof, independent of measures, 
of Theorem 6.9. ~ 


COROLLARY 4.5 If X is a thin complete code, then X is a maximal code. 
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Proof. Let A = (Q,1,1) be an unambiguous trim automaton recognizing X* and let y 
be the associated representation. Let x € X* such that e = y(x) is an idempotent of ga 
the minimal ideal J of the monoid y(A*). (Such an idempotent exists by Theorem 
claim 3). 

Let y g X. Then ey(y)e = p(xyx) is in the H-class of e. This H-class is a finite 
group. Thus there exists an integer n > 1 such that (y(xyx))” = e. Consequently 
(xyx)” € X*. This shows that X U y is not a code. n 


Let X C At be a code and let A = (Q,1,1) be an unambiguous trim automaton 
recognizing X*. We have shown that X is very thin iff the monoid M = y4(A*) has 
elements of finite, positive rank. Let r be the minimum of these nonzero ranks, and let 
K be the set of elements in M of rank r. Set p = yy. It is useful to keep in mind the 
following facts. tee 

gl Xx *) meets K. Indeed y(X*) = Stab(1) and according to Theorems and 
meets Stab(1). 

2. Every H-class H contained in K that meets y(X*) is a group. Moreover, y(X*)NH 
is a subgroup of H. These #-classes are those which contain an idempotent having 1 
as a fixed point. 

Indeed, let H be an H-class meeting y(X*). Let h € HN y(X*). Then h? is not 
the null relation ‘ince h? € Stab(1). Thus h? € H and consequently H is a group 
(Proposition eee Let N = HN y(X*). Since y(X*) is stable submonoid of M, N 
is a stable submonoid of H, hence a subgroup (Example Te 

Figure represents, with slashed triangles, the intersection KNy(X*). It expresse LG 
that the H-classes of K meeting y(X*) “form a rectangle” in K (see Exercise paeta 
Collecting together these facts, we have proved. 


THEOREM 4.6 Let X C A% be a very thin code. Let A = (Q,1,1) be an unambiguous 
trim automaton recognizing X*. Let K be the set of elements of minimal nonzero rank 
in the monoid M = py(A*). 

1. pa(X*) meets K. 

2. Any H-class H in K that meets p,(X*) is a group. Moreover, H N p4(X*) 
is a subgroup of H. 

3. The H-classes of K meeting pa(X*) are those whose idempotent has the state 
1 as a fixed point. 


ALAA 


Figure 6.20: The minimal ideal 4_25 


Another consequence of the results of this ‘eget ah! the following lemma 
which was stated without proof in Chapter i ( bent 
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LEMMA 4.7 Let X be a complete thin code. For any word u E€ X* there exists a 
word w E€ X*uX* satisfying the following property: if ywz € X*, then there exists a 
factorization w = fug such that yf,gz € X*. 


Proof. Let p be the representation associated with some unambiguous trim automaton 
recognizing X*. Since X is thin, the monoid M = y(A*) has a minimal ideal J. Since 
X is complete, M has no zero and thus y(XT) gets J . Let e be an idempotent, i ie 
p(X*)nJ. The group Ge is transitive by Theorem B.7Jand, according to Proposition e 
there exist words v1, v2,...,Un41 € Y71(H (e)) such that the word v = viv: Un+1 has 
the following property: if yvz € X* for some y,z € A*, then there exists an integer i 
such that yv1 -< Uj, Viti ++ Un412 E X*. 

We have ey(u)e E€ eMe = H(e), and ey(uje € y(X*). Since H(e) N y(X*) is a 
subgroup of H (e), there exists h € H (e) N y(X*) such that ey(u)eh = e. Since h = eh, 
we have ey(u)h = e. Consider words r € y7} (e), s € py !(h), set u’ = rus and consider 
the word 

wav vyu'vg::-u'Un iu. 


Let y, z E€ A* be words such that ywz € X*. Since y(u’) = e, we have 


Consequently also yvz is in X*; thus for some integer i, 
YVI ++ Vi, Vi41 t Un41z E X*. 


Observe that 
_ ! ! 1 

p(viva ++: Uj) = plu viu va: uvi) 

and 
/ / / 
P(Vit1 + Unyi) = PVU +++ UW Unga ). 
Thus also yu’vyu'vg:--u'v; and Viiu’ +++ Un4u'z are in X*. 
Let 


/ / / / / 
f = u viu vg: w vir, J = SUj41U ` Unpi U. 


Since r,s € X*, we have yf,gz € X* and this shows that the word w = fug satisfies 
the property of the statement. n 


Finally, we note that for complete thin codes, some of the information concerning 
the minimal ideal are characteristic of prefix, suffix, or bifix codes. 


PROPOSITION 4.8 Let X be a thin complete code over A, let p be the representation 
associated with an unambiguous trim automaton A = (Q,1,1) recognizing X*, let M = 
p(A*) and J its minimal ideal. Let Ho, Ro, Lo be an H, R, L-class of J such that 
Ho = Ro N Lo and y(X*)N Ho # 0. 
1. X is prefix if and only if p(X*) meets every H-class in Lo. 
2. X is suffix if and only if y(X*) meets every H-class in Ro 
3. X is bifix if and only if p(X*) meets all H-classes in J. 
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Proof. (1) Let H be an H-class in Lo, let e9 be the idempotent of Ho and let e be the 
idempotent of H (each H-class in J is a group). We have eoe = eo since e € Lo (for 
some m, we have me = eo; consequently eg = me = mee = ee). 

If X is prefix, then y(X*) is right unitary. Since eg E€ y(X*) and ep = eoe, it follows 
that e € y(X*). Thus HN y(X*) £90. 

Conversely, let us show that y(X%*) is right complete. Let m € M. Then meo € Lo, 
thus meg € H for some H-class H C Lg. If n is the inverse of mep in the group H, 
then meon € y(X*). Thus y(X*) is right complete and X is prefix. 

The proof of (2) is symmetric, and (3) results from the preceding arguments. n 


Proposti can be generalized to codes which are not maximal (see Exercise ES 

Let X C A* be a thin, maximal prefix code, and let A = (Q,1,1) be a complete, 
deterministic automaton recognizing X*. The monoid M = p4(A*) then iş a monoid 
of (total) functions and we use the notation already introduced in ho pretenia 
write, for m € M, qm = q' instead of (q, m, q’) = 1. Let m € M, and w € A* with 
m = (w). The image of m is 


m(m) = Qm=Q- w, 
and the nuclear equivalence of m, denoted by Ker(m), is defined by 
q= g (Ker(m)) & qm = qm. 


The number of classes of the equivalence relation Ker(m) is equal to Card(Im(m)); 
both are equal to rank(m), in view of Example ee 

A maximal nuclear equivalence is an equivalence which is maximal among the nuclear 
equivalences of elements in M. It is an equivalence relation with a number of classes 
equal to r(M). A minimal image is similarly an image of cardinality r(M), i.e., an 
image which does not strictly contain any other image. 


PROPOSITION 4.9 Let X C A* be a thin maximal prefix code, let A = (Q,1,1) be a 
deterministic, complete automaton recognizing X*, let M = p,(A*) and let K be the 
D-class of the elements of M of rank r(M). Then 

1. there is a bijection between the minimal images and the L-classes of K 


2. there is a bijection between the maximal nuclear equivalences and the R-classes 
of K. 


Proof. 

1. Let n,m € M be two L-equivalent elements. We prove that Im(m) = Im(n). 
There exist u,v E€ M such that m = un,n = vm. Thus Qm = Qun C Qn, and also 
Qn C Qm. This shows that Im(m) = Im(n). 

Conversel let m,n € K be such that Im(m) = Im(n). K being a regular D-class 
(Theorem , the £-class of m contains an idempotent, say e, and the £-class of 
n contains an idempotent f (Proposition pe on Im(e) = Im(m) and Im(f) = 
Im(n), in view of the first part. Thus Im(e) = Im(f). We shall see that ef = e and 
fe=f: 

Let indeed q € Q, and q' = qe. Then qd’ € Im(e) = Im(f). Thus qd’ = df since f is 
idempotent. Consequently qe = qef. Thus e = ef. The equality fe = f is shown by 
interchanging e and f. These relations imply eL f. Thus m£n. 
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2. The proof is entirely analogous. n 


Note also that in the situation described above, every st tg appears in some minimal 
image. This is indeed the translation of Theorem . This description of the 
minimal ideal of a monoid of functions, by means of minimal images and maximal 
equivalences, appears to be particularly convenient. 


EXAMPLE 4.2 Let X = {aa, ba, baa}. We consider the automaton given in Fig. Ei 


Figure 6.21: An automaton for X* 


The 0-minimal ideal of the corresponding monoid is the following: it is formed of 
elements of rank 1. 


001 110 


with 


For each element we indicate, on the top, its unique nonnull row, and, on its left, its 
ay 
a2 


unique nonnull column (with the convention a,---a', = |). The existence of an 


an 
idempotent is indicated by an asterisk in the H-class. The column-row decomposition 
of an idempotent is simply given by the vectors in the rows and columns of the array. 
For example, the column-row decomposition of a is 


0 
aß = |1| [0 0 1] 
1 
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The following array gives the fixed point of each idempotent 


EXAMPLE 4.3 Let X = {aa, ba, baa, bb, bba}. We consider the automaton given in 
Fig. FA The corresponding monoid has no 0 (the code is complete). 


(2) 
© a,b 


Figure 6.22: An automaton for X* 
The minimal ideal, formed of elements of rank 1, is represented by 


001 110 


EXAMPLE 4.4 Let A = {a,ā,b,b}. Denote by 0 the congruence on A* generated by 
the relations g 
aa~ l, bb a~ 1. 


The class of 1 for the congruence @ is a biunitary submonoid. We denote by D4 the code 
generating this submonoid. The set D$“ can be considered to be the set of “systems 
of parentheses” with two types of brackets: a,b represent left brackets, and a,b the 
corresponding right brackets. 

The code D4 is thin since D4 is not complete. Indeed, for instance, ab ¢ F(D4) since 
ab ¢ F(D}*). However, D} is not very thin. Indeed, for all w € Di*, we have awa € D}. 
The code Dh, is bifiy. Let A(D5") = (Q,1,1), let p = p4 and let M = y(A*). By 
Proposition aime monoid M is isomorphic with the syntactic monoid of D4". We 
have D5* = y~!(1) since Dj" is the class of 1 for a congruence. 
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The monoid M contains a 0 and 
p+ (0) = F(D4’). 


The only two sided ideals of M are M and 0. Indeed, if m € M — 0 and w € y7! (m), 
then w € F(D$"). Therefore, there exist u,v € A* such that uwv € DS". Hence 
yp(u)my(v) = 1 whence 1 € MmM and MmM = M. 

This shows that M itself is a 0-minimal ideal. Nonetheless, do s. got contain 
any 0-minimal right ideal. Suppose the contrary. By Proposition pe M would be 
the union of all 0-minimal right ideals. Thus any element of M — 0 would generate a 
0-minimal right ideal. This is false as we shall see now. 

For all n > 1, y(a")M D vy(a"t!)M. This inclusion is strict, since if y(@”) = 
y(a"*tw) for some w € A*, then a”a” € D}* would imply a"a”"*!w € D}*, whence 
aw € D}* which is clearly impossible. 

This example illustrates the fact that for a code X which is not ve thin, no au- 
tomaton recognizing X* has elements of finite positive rank (Theorem f 


5 Group and degree of a code 


Let X C At be a very thin code, let A} (X) be t ¢. flower automaton of X and let yp 
be the associated representation. By Theorem 4. I} the monoid yp(A*) has elements of 
finite, positive rank. 

The group of the code X is, by de nition, , the Suschkewitch group of the monoid 
pp(A*) defined at the end of Section ki If is a transitive permutation group of finite 
degree. Its degree is equal to the minimal rank r(yp(A*)) of the monoid yp(A*). 

We denote by G(X) the group of X. Its degree is, by definition, the degree of the 
code X and is denoted by d(X). Thus one has 


d(X) = r(ep(A")) 


We already met a notion of degree in the case of thin maximal bifix codes. We shall 
see below that the present and previous notions of degree coincide. 

The definition of G(X) and d(X) rely on the flower automaton of X. In fact, these 
concepts are independent of the automaton which is considered. In order to show this, 
we first establish a result which is interesting in its own right. 


PROPOSITION 5.1 Let X C At be a thin code. Let A = (P,1,1) and B = (Q,1,1) 
be two unambiguous trim automata recognizing X*, and let y and w be the associated 
representations. Let M = y(A*), N = W(A*), © = y(F(X)), Y = V(F(X)), let E be 
the set of idempotents in ®, and E’ the set of idempotents in Y. 
Let p: P — Q be a reduction of A onto B and let pj: M — N be the surjective 
morphism associated with p. Then 
1. (E) =f". 
2. Lete € E, e = f(e). The restriction of p to Fix(e) is a bijection from Fix(e) 
onto Fix(e'), and the monoids Me and Ne are equivalent. 
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Proof. Since A and B recognize the same set, we have p~'(1) = 1 (Proposition By 
The morphism p : M — N defined by p satisfies Yy = fo ọ. 

1. Let e € E. et ple) = P(e”) = A(e)?. Thus A(e) is an idempotent. If e = y(w) 
for some w € ie , then p(e) = w(w), whence (e) € Y. This shows that (E) c F”. 

Conversely, | Pi and let w € F 3 ) with e' = (w). Then y(w) has finite rank 
by a £2 by Proposition ere is an integer n > 1 such that (y(w))” 
is an idempotent. Set e = a ae ))"; then e = y(w”) and w” € F(X). Thus e € E. 
Next (e) = y(w”) = il . This shows that p(F) = F”. 

2, Let S = Fix(e), S = ~ Fix(e ^). Consider s € S$ and let s’ = p(s). From ses, we 
get e's. pas gle) cS’. Conversely, if s’e’s’, then peq for some p,q € p~1(s’). Dy 
Proposition there oat s € S such that peseq. This implies that s’e’p(s)e’s 
and, by Bu n. p(s) = s’. Thus p(S) = S. 

Now let s,t € S be such ae pls) = p(t) = s'. If s = 1 then t = 1, since p +1) = 1. 
Thus we may assume that s,t 4 1. Since e € ©, there exist w € F(X) with e = y(w) 
and factorizations w = uv = u'v’ such that 


pluv) = p(u'v') =e 


and 
s — ]— s t—1l—t 
This implies that 
gS ia, gh lt eg 
whence in particular in B 
IAL 


Since p~'(1) = 1, this implies that there is also a path 1 bin A; this in turn implies 
that 


or, equivalently, (s,e,t) = 1. Since e is an idempotent and s,t € S, this implies that 
s =t. Thus the restriction of p to S is a bijection from S onto S”. 

Since (eMe) = e'Ne', the restriction of p to S defines an equivalence between Me 
and Nv. 7 


PROPOSITION 5.2 Let X be a very thin code over A. Let A = (Q,1,1) be an unambigu- 
ous trim automaton recognizing X*, and let p be the associated representation. Then 
the Suschkewitch group of p(A*) is equivalent to G(X). 


Proof. According to Proposition Pt there exists a reduction from A7,(X) onto A. Let 
e be a nonnull idempotent in the 0-minimal ideal of M = yp(A*). The image of e by 
the reduction is a nonnull idempotent e’ in the 0-minimal ideal of N = y(A*). Since 
yp(F(X)), e(F(X)) are ideals (which are nonnull because they meet yp(X*), y(X*)), 
both e € yp(F(X)),e’ € v(F(X)). By the preceding proposition, Me ~ Ne. Thus 
G(X) ~ Ne — {0} which is the Suschkewitch group of y(A*). m 
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EXAMPLE 5.1 Let G be a transitive permutation group on a finite set Q, and let H be 
the subgroup of G stabilizing an element q of Q. Let y be a morphism from A* onto 
G, and let X be the (group) code generating X* = y-!(H). The group G(X) then is 
equivalent to G and d(X) is the number of elements in Q. 

In particular, we have for all n > 1, 


G(A”) = Z/nZ, d(A”) =n. 


6 Interpretations 


Proposition By shows that the group of a very thin code and consequently also its 
degree, are independent of the automaton chosen. Thus we may expect that the degree 
reflects some combinatorial property of the code. This is indeed the fact, as we will see 
now. 

Let X be a very thin code over A. An interpretation of a word w € A* (with respect 
to X) is a triple 

(d, x, g) 
with 
dE A xX, x E X*, gEXA 


and 
w = dzg. 


We denote by I(w) the set of interpretations of w. Two interpretations (d, x, g) and 
(d',x',g') of w are adjacent or meet if there exist y, z,y’, z’ E X* such that 


r=yz, waye;  dy=dďy,  zg=zg 


(see Fig. br). Two interpretations which do not meet are called disjoint. A set 
A C I(w) is disjoint if its elements are pairwise disjoint. 
Let w € A*. The degree of w with respect to X is the nonnegative number 


dx(w) = max{Card(A) | A c I(w), A disjoint}. 


Thus 6x (w) is the maximal number of pairwise disjoint interpretations of w. Note that 
for w € F(X), 
ôx (uwv) < ôx (w). 


Figure 6.23: Two adjacent interpretations 


Indeed, since w is not a factor of a word in X, every interpretation of uwv gives rise 
to an interpretation of w, and disjoint interpretations of uwv have their restriction to w 


Version 22 janvier 2008 J. Berstel, D. Perrin and C. Reutenauer 


6. Interpretations 307 


also disjoint. Observe also that this inequality does not hold in general if w € F(X). In 
particular, a word in F(X) may have no interpretation at all, whereas ôx (w) is always 
at least equal to 1, for w € F(X) N X*. 


PROPOSITION 6.1 Let X be a very thin code. Then 
d(X) = min{dx(w) | w € X* Nn F(X)}. 


Proof. Let Ap(X) = (P,1,1) be the flower automaton of X, with the shorthand nota- 
tion 1 instead of (1,1) for the initial and final state. Let M = yp(A%*), let J be the 
0-minimal ideal of M, let e be an idempotent in yp(X*)N J and let S = Fix(e). Then 
by definition d(X) = Card(S), 5 

According to Proposition pi 8 have yp (e)N F(X) #0. Take a fixed word x € 
yp (e)n F(X). Then x € X* N F(X), since e € pp(X*). 

Let w € X* N F(X) and let us verify that d(X) < x(w). For this, it suffices 
to show that d(X) < 6x(xwax), because of the inequality ôx (xwx) < dx(w). Now 
pp(«wx) € H(e), and consequently its restriction to S is a permutation on S. Thus 
for each s € S, there exists one and only one s’ € S such that (s, yp(«wz), s’) = 1, or 
equivalently such that 


twr sy 
— S 


Since w € F(X), this path is not simple. Setting s = (u, d), s’ = (g, v) it factorizes into 


d Yy 


g 
s > 1 >] > 3! 


and (d,y,g) is an interpretation of xwx. Thus each path from a state in Ș to another 
state in S, labeled by xwa, gives an interpretation of rwx. Two such interpretations 
are disjoint. Assume indeed the contrary. Then there are two interpretations (d1, yi, 91) 
and (d2,y2,92) derived from paths sı = si and s2 = s that are adjacent. This 
means that the paths factorize into 


gı 1 
1— s}, 


g2 1 
1 > 1 ? S9 


with d1z1 = dz, and also zg, = z392. Then there is also, in A} (X), a path 


1 
Za 


2 
> 1 ys, 


labeled xwx. This implies (s1, pp(xwa), s3) = 1; since s4 E€ S, one has s = s}, whence 
59 = Sj. 
Thus the mapping associating to each fixed point an interpretation produces a set of 
pairwise disjoint interpretations. Consequently Card(S) < ôx (rw). 
We now show that 
5x (a*) < d(X), 


where x is the word in on (e)N F(X) fixed above. This will imply the proposition. 
Let (d,y,g) be an interpretation of z3. Let p = (u, d), q = (g,v) € P. Then there is 
a unique path 


p | Yj l, q, (6.6.1) 
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and moreover the paths p as 1,1 B q are imple or null. Since yp(x) = e, there 
exists a unique s € S such that the oo also factorizes into 


P = >q 


Since x € F(X), the word d is a prefix of x and g is a suffix of X. 
Thus there exist words z, Z € A* such that 


yY = 222, dz = £ = Zg. 


Observe that the fixed point S associated to the interpretation is independent of 
the endpoints of the path ( = Consider indeed another path 


p' ti Ya I, 4 
associated to the interpretation (d,y,g), and a fixed point s’ € S such that p -= 


x x A = * A d z £ 
s — s' — q'. Since x = dz = Zg, the above path factorizes in pf — 1 —> s’ —> 


s! Š 1 -b q'. The uniqueness of the path 1 4 1 forces s = s’. 
Thus we have associated, to each interpretation (d, y, g), a fixed point s € S, which 
in turn determines two words z, Z such that y = zxz, and 


1 > S > S >]. 


We now show that the fixed points associated to distinct interpretations are distinct. 
This will imply that ôx (x?) < Card(S) = d(X) and will complete the proof. 

Let (d',y',g') be another interpretation of x°, let p' = (u’,d’), q! = (g',v') € P, and 
assume that the path 


decomposes into 


p N Ea > S Z1 bg (6.6.2) 


Since a € F(X), the path s > s is not simple. Thus there exist h, h € A* such that 


x= hh and 7 
h h 
s—1—s. 


ut 3 
The paths Eana (60-4 become 


p— 1— s— |1 pe, 1g 


d! z! h h z! ' 
p — 1 > § > J > § > Pag, 


This shows that zh, hz, z'h, hz! € X*. Next dz = d'z! = x. Thus dzh = ď'z'h, showing 
that the interpretations (d, y, g) and (d',y',g') are adjacent. The proof is complete. 


Now we are able to make the connection with the concept of degree of bifix codes 
introduced in the previous chapter. If X C At isa w n pe code, then two 


adjacent interpretations of a word w € Gy are equal Lae hus 6x(w) is the number of 
interpretations of w. As we have see hapter ins number is ae on H(X), 
whence on F'(X). Thus Proposition = that the two notions of degree we have 


defined are identical. 
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7 Exercises 


. ction4.2 
Section 


1.1 Let X be a subset of At and let A} (X) = (P, (1, 1), (1, 1)) be the flower automa- 
ton of X. Let y be the associated representation. Show that for all 


(p,q), (r,s) € P and we A* 


we have 
((p, 4), p(w), (7, 8)) = (q(X.)*r, w) + (pw, r)(q, ws). 


1.2 Let A = (P,i,T) and B = (Q,j, S) be two automata, and let p: P —> Q bea 
reduction from A on B such that i = p~!(j). Show that if A is deterministic, then so 
is B. 


. section4.3 
Section 


2.1 The aim of this problem is to prove that for any stable submonoid N of a monoid 
M, there exists a morphism y from M onto a monoid of unambiguous relations over 
some set Q and an element 1 € Q such that 


N={meM | 1y(m)1} 


For this let 
D={(uv)EMxM|uweEN} 


Let p be the relation over D defined by 
(u,v)p(u',v) if and only if Nun Nu’ Ad and vN NVN £0. 


Show that the equivalence classes of the transitive closure p* of p are cartesian products 
of subsets of M. 

Show that N x N is a class of p*. 

Let Q be the set of classes of p* and let 1 denote the class N x N. Let y be the 
function from M into P(Q x Q) defined by 


(U x V)g(m)(U' x V’) & Um C U’ and mV’ C V. 
Show that y is a morphism and that 
N = {mE M | 1y(m)1} 


Show that in the case where M = A*, the construction above coincides with the 
construction of the flower automaton. 


2.2 Let K be a field and let m be an n x n matrix with elements in K. Show that 
m = m? if and only if there exist 1 € K”*P and r € K?*” such that 


m= lr and rl = Jp, 


where J, denotes the identity matrix. 
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2.3 Let M be an unambiguous monoid of relations over a set Q. Let D be a D-class 
of M containing an idempotent e. Let R (resp. L) be the R-class (resp. the L-class) 
of e and let A (resp. T) be the set of its H-classes. Let (ay, a4,;)Hea be a system of 
coordinates of R, and let (bx ,0;,)Ker be a system of coordinates of L. Let e = Ir be 
the column-row decomposition of e and set ry = ray,IlK = bgl. 

The sandwich matrix of D (with respect to these systems of coordinates) is defined 
as the A x IT matrix with elements in Ge U0 given by 


rylx if eanbre € H(e), 
SHK = ; 
0 otherwise 


Show that for all m € M, H €A, K ET, 
(H x m)SwKg = Suk (mx« K) 


with H'=H-m, K'=m.-K. 
Show that D is isomorphic with the semigroup formed by the triples (H,g, K) ET x 
Ge x A with the product defined by 


(K, g, H)(K',g', H') = (K, gSnx'g', H’). (6.7.1) 


2.4 Let A = (P,1,1) and B = (Q,1,1) be two unambiguous trim automata. A 
reduction p from A to B is said to be unambiguous if there is a pair (A, u) of partial 
functions from P to Q which are restrictions of p and such that for each path q > q' 
in B there exists a unique pair p € \~'(q) and p' € !(q’) such that p & p' is a path 
in A. Such a pair (A, u) is called an unambiguous realization of p. 

(a) Verify that the functio 3, # given below form an unambiguous realization of the 
reduction p of Example k 4 


Hint: show that there exists an invertible matrix R such that 
R=|L|, R'=|W M X] 


where L is the matrix of the relation A7} and M is the matrix of the relation u with 


0 0 0 
Rgplc) R = |* (c) 0 
* 0 


x 


for each letter c = a, b. 
(b) Show that if the monoid y4 has finite minimal rank, then any reduction from A to 
B is unambiguous. 
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i section4.4 
Section 


3.1 Let K be a semiring and let m be a K-relation between P and Q. The rank over 
K of m is the minimum of the cardinalities of the sets R such that 


m= lr 
for some K-relations 1 € K?*®, na Denote it by rankg(m). The rank of 
a relation, as defined in Section Fe is therefore also its rank when considered as an 


N-relation. 
Show that if K is a field and Q is finite, the rank over K coincides with the usual 
notion of rank in linear algebra. 
1 1 
1 0 


0 
7 1 
m=]oð11o0 
0011 
m) 


= 3. 


3.2 Let 


Show that ranky(m) = 4, but that rankz( 


3.3 Let M be an unambiguous monoid of relations over Q, that is transitive and has 
finite minimal rank. Let 1 € Q and N = Stab(1). 

Let A (resp. T) be the set of 0-minimal or minimal left (resp. right) ideals of M, 
according to M contains or does not contain a zero. 

Let R, R' €T, L, L’ € A. Show that if 


RNLAON#O and ROLAND, 


then also 
ROL'nN#@ and RNALANZD. 


In other words, the set of pairs (R, L) € I x A such that RN LA N Æ f is a Cartesian 
product. 


3.4 Let M be a transitive unambiguous monoid of relations on Q which has finite 
minimal rank and which does not contain the null relation. Let U be the set of nonzero 
rows of the elements of M. Show that the following conditions are equivalent for v € U. 

(i) v is a row of an element of M of minimal rank. 
(ii) OZ uM. 
(iii) v is maximal among the rows of the elements of M. 
(iv) v is a row of an element of M with a minimal number of nonzero distinct rows. 


3.5 Let X be a thin maximal code and let A = (Q,1,1) be a trim unambiguous 
automaton recognizing X*. Let y be the associated representation and let M = y(A*). 
(a) Show that a word w is strongly right completable if and only if 0 ¢ y(w)ix M. 

(b) Let K be the minimal ideal of M. Show that any word w € y~!(M) is simplifying 
and strongly right completable. 
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3.6 Let M be an unambiguous monoid of relations on a finite set Q. Let L (resp. C) 
be the set of rows (resp. columns) of the elements of M. Show that for each r € L, 
méeMandleC 

rml < 1. 


Conversely, let L (resp. C) be a set of row (resp. comlumn) vectors in P(Q) such that 
L={re2?|rl<1foralllec}, C={le2¢|rl<1 forallreL}. (6.7.2) 


Let 
M ={me€ 28% | rml < 1 for all r € L and 1€ C}. 


(i) Show that M is a transitive monoid of unambiguous relations on Q which con- 
tains all products lr for r € L and l€ ©. 

(ii) Show that any transitive unambiguous monoid of relations is a submonoid of 
one obtained in this way. 


3.7 Let M be a transitive unambiguous monoid of relations on a finite set Q not 
containing the relation 0. Let L (resp. C) be the set of rows (resp. columns) of the 
elements of M which are maximal. Let U be the set of sums of the distinct rows of the 
elements of minimal rank of M and let V =C. 

Show that for each u € U, m € M and v E€ V 


umv = 1. 
Conversely, let U (resp. V) be a set of row (resp. column) vectors such that 
U = {uc 2 |w =1 for allue V}, V= {ve 2? | uw =1 forallueU} (6.7.3) 


and such that for all p E€ Q there is a u € U (resp. v € V) such that up = 1 (resp. 
Up = 1). Let 


M = {me 2°%2 | umv = 1 for all u € U and v € V}. 


(i) Show that M is a transitive unambiguous monoid of relations on Q not contain- 
ing 0. 

(ii) Show that any transitive unambiguous monoid of relations not containing 0 is a 
submonoid of one obtained in this way. 


3.8 An unambiguous monoid of relations on a finite set Q with n elements is said to 
be very transitive if it contains a transitive group G of permutations on Q. The aim of 
this exercise is to show that all elements of a very transitive unambiguous monoid of 
relations have the same number n of elements (as subsets of Q x Q). 

Let, e be an idempotent of minimal rank. Let u be the sum of the distinct rows of 
e and let v be a column of e. Let r = Card(u) and s = Card(v). Let U = uG be the 
orbit of u under the right action of G and let V = Gv be the orbit of v under the left 
action of G. Let p = Card(U) and q = Card(V). 
(i) Show that for each q € Q, the number of elements of U containing q is inde- 
pendent of q. Let h be this integer. In the same way, let k be the number of 
elements of V containing a given q € Q. 
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(ii) Show that 
rp=hn, sq=hn 


and 
rk=p, sh=q. 


(iii) Show that for each m € M, pq = thk where t is the cardinality of m (as a subset 
of Q x Q). Conclude that t = n. 


3.9 Show that for any transitive unambiguous monoid of relations M on a finite set 
Q, there is a finite set R containing Q and a transitive monoid of unambiguous relations 
N on R not containing 0 such that the elements of M are subsets of the restriction to 
Q x Q of elements of N. 


3.10 Let G be a graph. A clique in G is a set of vertices such all pairs are edges of G. 


A set of vertices is stable if no pair of elements is an edge. Consider the set L of cliques 

{ A and the set R of stable sets. Show that the pair (L, R) satisfies the equalities 
of Exercise 

Let U (resp. V) be the set of maximal cliques (resp. stable sets). Show that if the 


graph G has the property that any peepee clique i tersects any maximal stable set, 
then (U,V) satisfies the relations (7a) ore xercise F A 


3.11 Let M be an unambiguous transitive monoid of relations not contaning zero. 
Show that for two elements m, m’ of M, if m < m’ then m = m. 


3.12 Prove that if A is an n-state transitive complete synchronized unambiguous 
automaton, there is a word of length at most (n? — n + 2)(n — 1)/2 such that y,4(w) 
has rank one. 

Hint: Prove that for a state p € Q and a word u € A*, if p(u)px is not a maximal row, 
there is a state q and a word v of length at most n(n—1)/2 such that p(u)p. < p(vu)qs. 


. section4.5 
Section 


4.1 Let X C A* be a very thin code. Let M be the syntactic monoid of X* and let 
vy be the canonical morphism from A* onto M. Show that M has a unique 0-minimal 
or minimal ideal J, according to M contains a zero or not. Show that y(X*) meets J, 
that J is a D-class, and that each H-class contained in J and which meets (X *) is a 
finite group. 


4.2 Let X C At be a very thin code, let A = (Q,1,1) be an unambiguous trim 
automaton recognizing X*. Let y be the associated morphism and M = y(A*). Let J 
be the minimal or 0-minimal ideal of M and K = J — 0. Let e € M be an idempotent 
of minimal rank, let R be its R-class and L be its £-class. Let A (resp. T) be the set 
of H-classes contained in R (resp. L), and choose two systems of coordinates 


(aH, ay) HEA, (bx, 0K) Ker 
of R and L, respectively. Let 
u: M => (Ge 0 
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be the morphism of M into the monoid of row-monomial A x A-matrices with elements 
in Ge UO defined by the ?-representation with respect to e. Similarly, let 


v: M =(G,u0s"™ 


be the morphism associated with the £-representation with respect to e. Let S be 
the andwich matrix of J relative to the systems of coordinates introduced (see Exer- 
cise Bai Stow that for all m € M, 


p(m)S = Sv(m) 
Show that for all m,n € M, 


u(m)= u(n) & (VH €A,rym=rgn), 
v(m) =v(n) & (VK ET,mlçg =nlçg), 


where ry = lag, lx = bgl and lr is the column-row decomposition of e. 
Show, using these relations, that the function 


me (u(m),v(m)) 
is injective. 


4.3 Let X C At bea very thin code. Let y be the representation associated with 
an unambiguous trim automaton A recognizing X*, let M = y(A*) and let J be its 
minimal ideal. 

Show that X is prefix if and only if for any idempotent e in J not in y(X*) one has 
Men y(X*) = 9. 


. section4.6 
Section 


5.1 Let X be a very thin code. Let M be the sy tactic monoid of X*, and let J be 
the 0-minimal or minimal ideal of M (see Exercise nied G be an H-class in J that 
meets y(X*), and let H = GN y(X*). 

Show that the representation of G over the right cosets of H is injective, and that 
the permutation group obtained is equivalent to G(X). 


5.2 Let X be a thin maximal code. Show that if z € A* is both strongly right and 
strongly left completable, then some power of z is in X*. 


5.3 Let X,Y be two codes. We define the meet of X and Y, denoted X A Y as the 
basis of the submonoid X* N Y*. Show that the meet of two thin codes X,Y C A*t is 
thin maximal over A if and only if there is a word x € X* strongly left completable 
in Y* and a word y € Y* strongly right completable in X* (a word x is strongly left 
completable if for any u € A* the word uy is left completable). 


5.4 Show that for any rational (resp. thin) code Z, there exist two rational (resp. 
thin) maximal codes X,Y such that Z = X AY. 
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8 Notes 
Unambiguous automata and their relation to codes appear in bchtitzenberger| (L965b)). 


Unambiguous automata are a special case of automata with multiplicities. The latter 
are the K-automata which are extensively studied in (L974) and 
(1984). Kleene’s theory for these automata goes back to 
(1961c). A significant step in the study of monoids of unambiguous relations using such 
tools as the column-row decomposition appears in (L974). 

An automaton with output labels such as the decoding automaton of Section 2 is 
called a transducer (see ((L974))). For a finite code, this transducer is finite 
and unambiguous. This is a particular case of a more general result from the theory 
of transducers: any function realizable by a finite transducer can also be realized by 


a finite unambiguous transducer (Elgot and Mezei (L965). Firat another proof of as 
result, see [Arnold and Latteux} (1981). On this subject, see Bilenberg] (1974), Berstel 


(£979), and (L983). 


The study of the structure of ie Pines in monoids of unambiguous relations 
is very close to the classical development for abstract semigroups presented in the 
usual textbooks. In particular, the R-and £- eS are the right and left 


Schiitzenberger representations of \Clifford and Preston (L961) and (L979). 


The generalization of the results of Section Z O a of (Boolean) relations is 
partly possible. See, for nn Poros aTa] (L980). The notion of rank and the 
corresponding results appear in (L979) for the particular case of monoids of 
functions. 
Theorem 4.1} 1s due to Schiitzenberger. extension to sets which are not codes 
appears in ee ({L979b). Problem E-Tjis a theorem due to Cass 
2-3) 


Extensions may be found in (1976). The notion of sandwich matrix (Exercise 
nesMax exomrnąkxo-cliques 


is a classicąl Jone. 

Exercise B iC Bsa 1s due to Césari] (1974). Exercises B-G to B.Id are due to [Bod (991). 
Exercise 8.8] gives an alternative re a result of [Perrin and Schiitzenberger| (L976) 
(see Exercise VIII.6.3). Exercise Fe sated with the embedding of codes into maxi- 
mal ones, although it does not provide an alternative to prove that every rational code 
is included in a maximal one (the relations corresponding to the letters may generate 
a monoid which is not transitive). Cliques in graphs play an important role in many 
problems of graph theory. The graphs having the property that any maximal clique 
meets any payimal stable set have a recently characterized (p006). 

Exercise pepe from (L987). The notion of unambiguous reduction has some 


connexion with the reduction of li gar representation of Exe serie »Berstel andl Berstel and} 
({L988))). Exercise per iom Béal L al] (2007). Exercise 7 from |Carpil 
(1988). A simplified proof appears in em, It shows that for transitive 


complete synchronized unambiguous automata, vier is a cubic upper bound for the 
length of a synchro izing word, as it is the case for synchronized deterministic automata 
(see Exercise ~As for deterministic automata, the optimal upper bound is not 
known. 


exo- teow iexbLattice2 


4 are from |Bruyèrel 
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Chapter VII 


Synchronization 


0 Introduction 


The notion of synchronization for codes and auutomata refers to the ability of parsing 
an input into code words with a limited amount of information. It adresses a more 
general situation than deciphering which is left-to-right oriented. The interest of syn- 
chronization lies in the possibility of recovering from errors, and by the local aspect of 
the tools involved. 

The chapter starts with the definiton of synchronizing pairs, and synchronizing words 
and of absorbing words. Next, two notions of finite deciphering delay are introduced, the 
first related to the number of words involved, the seco connected to local automata. 
These notions have already be consi red in Chapter j for prefix codes, and also in 
the case of circular codes in Chapter Re 

In this chapter, the main results concern the completion of rational uniformly syn- 
chronized codes and of locally parsable codes. 


1 Synchronizing pairs 
Let X C At be a code. A pair (x,y) of words of A* is synchronizing if for any words 


u,v € A*, one has 
uzyv E€ X* > uxz,yv € X*. 


A word x € A* is synchronizing if 


Figure 7.1: A synchronizing pair. 


uzv E€ X* => ux,rv € X*. 
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Observe that if X is a prefix code, then x € A* is synchronizing if 
uzv E€ X* => ux € X*. 
apter2 
This is precisely the definition given in Chapter i i 


PROPOSITION 1.1 Let X C At be a code. If x,y € A* are synchronizing words, then 
the pair (x,y) is synchronizing. Conversely, if (x,y) is a synchronizing pair of words 
of X*, then xy is a synchronizing word. 


Proof Let x,y be a synchronizing xords. If uxyv € X*, then ua € X* because x is 
synchronizing, and yv € X* because y is synchronizing. Thus (x,y) is a synchronizing 
pair. Conversely, if (x,y) is a synchronizing pair, uxyv € X* implies ux, yu € X* and 
thus uzy, xyv € X*. Thus zy is a synchronizing word. n 


A code X is called synchronized if there exist synchronizing pairs of words of X*. In 
view of the eceding proposition, this terminology is compatible with that introduced 
in Chapter Ht existence of a synchronizing pair (x,y) has the following meaning. 
When we try to decode a word w € A*, the occurrence of a factor xy in w implies that 
the factorization of w into words in X, whenever it exists, must pass between x and y: 
if w = uxyv, it suffices to decode separately ux and yv. 

The following is a characterization of synchronized codes. 


PROPOSITION 1.2 Let X be a very thin code over A. Then X is synchronized if and 
only if d(X) =1 


Proof. Let X be a synchronized code, and let x * A Pe fx ) be a word such that 
ôx(x) = 1. Such a word exists by Proposition i syed We show that (x,x) is a 
synchronizing pair. Let indeed u,v € A* be such that uxrv € X*. The interpretation 
(1, uxzzv, 1) induces two interpretations of x, the first on the first occurrence, the second 
on the second occurrence of x in uxxv. Denote them by 


ud y gd! y g'vx 


Figure 7.2: Two interpretations of x 


(d,y,9), (d',y', 9’) 


with ud € X*,g'v € X*, and gd’ € 1U X. Since (d,y,g) and (1,2,1) are adjacent 
interpretations (degree is one), there exist £1, £2, Y1, Y2 E X* such that 


£ = T172, Y = Y1Y2, zı = dy, LQ = yog. 
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Consequently, ux = uxız2 = (ud)yixq E€ X*. Similarly zv € X*. 

Conversely, let x,y € X* be words satisfying the implication. Then ry € F(X). 
Otherwise there would exist u,v € A* such that uxyv € X; but then uz,yv € X*, 
contradicting the fact that X is a code. Next consider an interpretation (d,z,g) of 
xy. Then ud,guv E€ X for convenient u,v € A*. Thus uxyv € X*, hence ua € X* and 
yv E€ X*. This shows that the interpretations (d, z,g) and (1, ry, 1) are adjacent. Thus, 
dx(xy) = 1. n 


A pair (x,y) of words of X* is absorbing if A*x N yA* C X*. A code X which has 
an absorbing pair is complete since for any word w, one has ywg € X*. 


EXAMPLE 1.1 Consider the suffix code X = ab* over A = {a,b}. Observe that X* = 
aA* and that X contains the code of the previous example. Every word in X is 
synchronizing. Every pair of words of X is absorbing. 


PROPOSITION 1.3 Let X C At be a code. Any absorbing pair is synchronizing. Con- 
versely, if X is complete, then any synchronizing pair is absorbing. 


Proof. Let (x,y) be an absorbing pair. Let u,v € A* be such that uxyv € X*. Then 
w = yuryve is in X*. Since w = (yux)(yvx) = y(uryvax), and y, yuz, uxyvx, yvx are 
in X*, it follows by stability that uz € X*. Similarly yv € X*. 

Conversely, let (x, y) be a synchronizing pair and let w € A*aNyA*. Thus w = ux = 
yv for some words u,v € A*. Since X is complete, there exist words u’,v’ € A* such 
that u'xwyv’ € X*. Since (x,y) is synchronizing, we have u'z, u'xw, wy’, yo’ E€ X* by 
synchronization. By stability, this implies w € X*. n 


As a consequence, we have the following characterizaion of complete synchronizing 
codes. 


PROPOSITION 1.4 Let X C At be a very thin code. Then X is complete and synchro- 
nized if and only if there exist absorbing pairs. 


EXAMPLE 1.2 The code X = {aa, ba, baa, bb, bba} is synchronized. Indeed, the pair 
(aa, ba) is an example of a synchronizing pair: assume that uaabav € X* for some 
u,v E€ A*. Since ab ¢ F(X), we have uaa, bav € X*. Since X is also a complete code, 
it follows by Proposition at (aa, bb) is absorbing. Thus 


baA*aa C X*. 


2 Uniformly synchronized codes 


A code X C At has verbal synchronization delay s if any x € X° is a synchronizing 
word. For simplicity we talk of the synchronization delay, when no confusion arises. 
The notion yerba] synchronization delay has been called simply synchronization delay 
in Chapter {M}. 
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apter7 : . i . i i 
Recall from Chapter iy that a code is uniformly synchronized if there is an integer s 
such that X has synchronization delay s. The least s of this kind is called the minimal 
synchronization delay of X. 


EXAMPLE 2.1 Consider over A = {a,b} the code X = {a,ab}. Every word in X is 
synchronizing. Therefore X has synchronizing delay 1. Consequently, every pair of 
words of X is synchronizing. 


The following result shows that a code with finite synchronization delay has also 
finite deciphering delay. More precisely 


PROPOSITION 2.1 The minimal deciphering delay of a code is at most equal to its 
minimal synchronization delay. 


Proof Let s be the minimal synchronization delay of X. Let x € X*, y € X° and u € A* 
be such that vyu € X*. Since X has synchronization delay s, we have xy, yu € X*. 
Thus y is simplifying. This shows that X has deciphering delay s. n 


The following example shows that the minimal deciphering delay may be finite but 
not the synchronization delay. 


EXAMPLE 2.2 Let X = {ab,ba}. Since X is prefix it has deciphering delay 0. It has 
infinite synchronization delay since for each n > 1, the word x = (ab)” satifies bra € X* 
although bz, xa ¢ X*. 


ior wn iter 
We prove now an analogue of Theor Jor Eat ior uniformly synchronized codes. The 


construction of the proof of Theorem Sater be use since jt does not even preserve 
the finiteness of the deciphering delay (see Example 


orem-RCFDD 
The following example shows that the construction sf the proof of Theorem ES 


neither applies. 


EXAMPLE 2.3 Consider again the code Sie layer A= {a, b} which has synchro- . em-RCFDD 
nizing delay 1. We have seen in Example at the construction used in Theorem 


gives the code Y = {a,ab} U {ab’,ab?a}H bb, ba}* which has deciphering delay 1. How- 
ever, Y has infinite synchronization delay since every (ab)” is a factor of ab(ba)"*! 
which is in Y, and thus no pair (ab)*, (ab)f is synchronizing. 


THEOREM 2.2 Any rational uniformly synchronized code is contained in a complete 
rational code with the same minimal synchronization delay. 


Proof. Consider a nonempty code X C A* with synchronization delay s and consider 
M = (X° A* N A* X°) U X*. (7.2.1) 


Observe that M is a submonoid of A*. Let Y be the minimal generating set of M. We 
show that Y is a code having the desired properties. The proof is in several steps. 

Let us first prove that Y is code. For this, we prove that M is stable. Let u, w,v € A* 
be such that u, uw, wv, v € Y*. We prove by induction on |wwv| that w € Y*. It is true 
for |uwv| = 0. Suppose that it is true for any such triple u’, w’, v’ with |u'wv'| < [uwv]. 
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Figure 7.3: Proving that M is stable. 


We consider several cases. 
Case 1. Suppose that u ¢ X* (the case v ¢ X* is symmetric). Then in particular 
u € AX" and thus u = tz with t € At and z € X5. We distinguish two cases. 

(i) If uw € X*, then, since uw = tzw, we have tzw € X*. Since z is synchronizing, 
we have u = tz € X*, a contradiction. 

(ii) If ww ¢ X*, then in particular uw € At X%. Thus uw = tz’ with t € At and 
z! € X*. Suppose first that |zw| > |z’|. Then zw € zA* N A*z’ and thus zw € Y*. 
We may thus apply the induction hypothesis to the triple (z,w,v). Otherwise, we have 
|zw| < |z’| and thus z’ = rzw for some r € A*. Then rzw € X* implies that rz € X*. 
We may thus apply the induction hypothesis to the triple (rz, w, v). 

Case 2. We have now u,v € X*. Suppose that wv ¢ X* (the case uw ¢ X* is 
symmetric). Then wv = zt with z € X* and t € A+. But uwv is in X* and uwv = uzt 
implies zt € X*, a contradiction. 

Case 3. Finally, if u, uw, wv,v € X*, then w € X* since X is a code. This proves that 
Y is a code. 

We now prove that X C Y. Let indeed x € X. Suppose that x = yy’ for two 
nonempty words of M. Then y or y’ is not in X*. We may suppose for instance that 
y’ g X*. Then y’ € X*A* and thus y’ = zu with z € X° and u € A*. Since z is 
synchronizing and yzu € X, we have y’ = zu € X*, a contradiction. Thus x ¢ (Y* \ €)? 
and thus z E€ Y. 

Next we show that Y is complete and has synchronization delay s. For this, we first 
prove that 

Y° c X°A*NA*XS. (7:2:2) 


Let indeed y = yiy2--- Ys with Y1, Y2;...-, Ys E Y. If all y; are in X, the conclusion is 
true. Otherwise let i be the least index such that y; X. Then y; E€ X*A* and since 
Yi,- --;Yi-1 E X, we obtain y € X*A*. The proof that y € A*X® is symmetric. 
Consider now y € Y*. Then by (AH for any u € A*, the word yuy starts and ends 
with a word in X*, and thus is in Y*. This shows that Y is complete. 
To show that Y has synchronization delay s, suppose that uyv € Y* for some u,v € 
A* and y € Y°. Let us prove that uy, yu € Y*. We only prove that uy € Y*, the same 


reasonjng holds for yv. 

By por has a suffix in X*. Thus uy has a suffix in X*. Let y = tz with t € A* 
and z € X5. 

Since uyv € Y*, either uyv € X* or uyv has a prefix in X®. If uyv € X*, the since 
z is synchronizing, we have utz = uy € X* and hence also uy € Y* (see Figure jae 

Otherwise, uyv has a prefix x in X®. If x is a prefix of uy, then uy € X*A*NA*X* and 
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thus uy € Y*. Otherwise, uy is a prefix of x. Since z is synchronizing, utz = uy E€ X*. 
Thus again uy € Y*. 


Figure 7.4: Proving that y = tz is synchronizing 


EXAMPLE 2.4 Consider again the code X = {q.q ; With synchronization delay 1 on 
the alphabet A = {a,b}. The set M defined by is M = aA* N A*X and the base 
of M is 

Y = (abb? )* X 


Indeed, the words of Y are exactly the words starting with a, ending with a or ab and 
such that the number of occurences of b between two a is at least 2. 


3 Locally parsable codes and local automata 


section4bis.2 


For L C A*, denote by Cz(w) the set of contexts of a word w in L, that is 
Cr(w) = {(u,v) € A* x A* | uwv € L}. 


A word w € A* is said to be a constant for L if for any (u,v), (u’,v’) € Cz (w) one has 
also (u, v’), (u’,v) € Cr(w). 
The following statement shows that the set of constants for L forms a two-sided ideal. 


PROPOSITION 3.1 If w € A* is a constant, then for all u,v € A*, the word uwv is a 
constant. 


Proof Let p,p’,s,s’ € A* be such that (p,s),(p',s’) € C(uwv). Then (pu,vs) and 
(p'u, vs’) are in C(w). Since w is a constant, we have also (pu, vs’), (p'u, vs) € C(w). 
Thus (p, s’), (p’, s") € C(uwv). Thus uwv is a constant. / 


We will consider constants for L = X* where X c At is a code. 


st-constSynch} PROPOSITION 3.2 If a word of X* is a constant for X*, then it is synchronizing. If 
y 


(x,y) E€ X* x X* is a synchronizing pair, then xy is a constant. 


Proof Let x € X* be a constant. If uzv is in X*, then since (e€,€) € Cx»(x), we have 
also uz, xv € X*. Thus x is synchronizing. 

Let (x,y) € X* x X* be a synchronizing pair. Suppose that (u,v), (u’,v’) € C((£y). 
Then uz, yu, u’x, yv’ are in X* and thus (u,v’), (u,v) E€ C(xy), showing that ry is a 
constant. m 
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A code has literal synchronization delay s if any word of A® is a constant for X*. A 
code is locally parsable if there is an integer s such that it has literal synchornization 
delay s. 


EXAMPLE 3.1 The code X = {a,aab} has literal synchronization delay 2. Indeed 
C(aa) = X* x (e+ b)X* and C(b) = X*aa x X*. 


EXAMPLE 3.2 The prefix code X = {ba, ca, aba, cba, aca, acba, aaca} is the Franaszek 


code. It has sync ronization, delay 4. Indeed, the minimal automaton of X* is repre- 
. aszekAutomaton N . 
sented on Figure f/7.5, One may verify that any word of length 4 is a constant. There is 


Figure 7.5: The minimal automaton of the Franaszek code. 


actually a unique word of length 3 which is not a constant, namely aac. The two-sided 
ideal of constants is generated by the finite set {aaa, b, ca, cc}. 


Let X be code with literal synchronization delay s. Let P = X*A™~ and S = A7 X*. 
It is a consequence of the definition that for any u,v,w € A* such that www € X* and 
|u| > s, we have 

v € P > vwe X* (7.3.1) 


Indeed, since v € P, there is a z € A* such that vz € X*. Then (e€, z), (u, w) € Cx» (v) 
implies (e, w) € Cx«(v). Similarly 


v E S => w E X* (7.3.2) 


FranaszekAutomaton 


synchGauche 


g . . o. nchroDeciph 
The following statement is the counterpart for the literal delay of Proposition 2 i 


PROPOSITION 3.3 The minimal literal deciphering delay of a code is at most equal to 
its literal synchronization delay. 


Proof Let x € X* „adeft completable word of length s and u € A* be such that 
xyu € X*. By Pike yu E€ X*. Thus y is simplifying. This shows that X has 
deciphering delay at most s. 7 


PROPOSITION 3.4 A locally parsable code is uniformly synchronized. The converse is 
true if the code is finite. 
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Proof Let X C At be a code with literal synchronization delay s. Then any word of X$ 
is of length at least s and is therefore a constant. Thus X has verbal synchronization 
delay s. 

Conversely, suppose that X C A* is a finite code with verbal syncronization delay s. 
Let £ be the maximal length of the words of X. Let w be a word of length 2¢(s +1). If 
w is not completable, then it is a constant. Otherwise, there are words £1, 2%2,...,2n 
in X such that w = x214%2---X%,. We may suppose that x22---2%p_1 is a factor of w. 
Then |w] < né implies As +1) <norn—2> 2s. Thus z2...£n—2 is a constant by 
Proposition Bd This mplies that w is a constant. Thus X has literal synchronization 
delay d. n 


A set Y C A* is said to be stricly locally testable if it is of the form 
Y = T U (U A* N A* V) \ AAWA* (7.3.3) 
where TU, V,W are finite subsets of A*. 


PROPOSITION 3.5 A code X is locally parsable if and only if X* is strictly locally 
testable. 


Proof Suppose first that X has literal synchronization delay s. We may suppose s > 1. 
Let T be the set of words in X* of length less than s. Let U = X*A N A® and 
V = A X* N A5. Finally, let W be the set of words w of length s+ 1 which are not in 
the set F'(X*) of factors of X*. Let us verify that X* = TU (U A* N A*V) \ A*WA*. 
The inclusion from left to right is clear. Conversely, let x be in the set defined by the 
right hand side. If |x| < s, then x € T and thus x € X*. Otherwise, let us first show 
by contradiction that x € F(X*). Suppose that x is not in F(X*). Let v be a factor 
of x of minimal length which is not in F(X*). Since x has no factor in W, we have 
|u| > s +1. Let v = ahb with a,b € A. Then ah,hb € F(X*) imply that there exist 
U1, U2, U3,U4 E Ax such that ujahus, uzhbu4 E X*. But since |ahb| > s +1, h isa 
constant. Thus ujahbu, € X*, a contradiction with the hypothesis v = ahb ¢ F(X*). 
Finally, let u,v € A* be such that uxv € X*. Since x € UA*, we have xv € X*. And 
since x E€ A*V, this implies in turn that x € X*. Thus X* is strictly locally testable. 
Suppose conversely that X* is strictly locally testable. Let T’,U,V,W be finite sets of 
words such that feet holds. Let s be the maximal length of the words of T, U, V, W. 
Let w be a word of length s+1 and let (u,v), (u’,v’) be in C(w). Since [uwv], |u’wo'| > 
s+1, we cannot have uwv € T or u'wv' € T. Thus uwv, u'wv' € U A* N A*V \ A*W A*. 
Since |uw|, |u’w| > s +1, we have uw, u'w € UA* and wv, wv’ € A*V. For the same 
reason uwv’, u'wv ¢ A*W A*. Thus (u,v’) and (u’,v) are in Cw), showing that w is a 
constant. Thus X has literal synchronization delay s. n 


Observe that, as a consequence of the above result, any locally parsable code is 
rational. 


EXAMPLE 3.3, Let X = {a,aab} be the code with literal synchronization delay 2 of 
Example Bef ten 


X* = aa A* \ A*{bb, bab} A*. 
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EXAMPLE 3.4 Let_A = {a,b,c} and let X be the Franaszek code of Example Boa “the 


sets U,V,W of can be chosen as 
U = {aaca, ab, aca, ach, b, ca, cb} 
V = {ba,ca} 


W = {aaaa, aaab, bb, bc, cc} 
with T = @. 
A finite automaton is called (¢,7)-local or simply local if there exist @,r > 0 such 


that for any paths p > q > r and p! 4 q! Š r’ such that |u| = £ and |v| = r, one has 
q = q. The integers £,r are called the memory and anticipation. 


K . . alAutomaton 
EXAMPLE 3.5 Let A be the automaton given in Figure ro It is (1, I)-local. Indeed, 


a 


Figure 7.6: A local automaton. 


any path labeled aa uses state 1 in the middle and there is only one edge labeled b. 


Let £,r > 0 and let n = 4+r +1. The free (£,r)-local automaton is the automaton 
with set of states the words of length + r and edges the triples (x,a, y) such that for 
some W = a1 ':' an E A” 


T = Q1°*'QAn-1; a = Ql+1, Y = A2: An. 


It is clear that this automaton is (£4, r)-local. 
The free (n, 0)-local automaton is usually known as the de Bruijn automaton of order 
n. 


EXAMPLE 3.6 The free (1, 1)-local automaton on the alphabet {a,b} is represented on 
Figure mie label of an edge going out of a state is its second letter. 


The following result shows in particular that a strongly connected local automaton 
is unambiguous. 


PROPOSITION 3.6 Let A be a strongly connected finite automaton on the alphabet A. 
The following conditions are equivalent. 
(i) A is local. 
(ii) A is anambiguous and there exists an integer s such that for any word w of 
length s, the relation pa(w) has rank at most 1. 
(iii) distinct cycles in A have distinct labels. 
Moreover, if A is local, it is unambiguous. 
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Figure 7.7: The free (1, 1)-local automata 


Proof Suppose first that A is (£, r)-local and let s = +r. Let w = wv with u € A‘ and 
v € A". Then there is at most one state q such that p > q > r for some states p,q. 
If the rank of y,(uv) is positive, such a unique q exists and (p,r) E€ yy if and only if 
p= qand q r. This shows that y4(uv) = 1. Thus (ii) holds. 

If (ii) is true, for any word w of length s, the relation p(w) has at most one fixpoint. 
This implies that (iii) is true. 

Suppose finally that (iii) holds. First observe that A is unambiguous. Indeed, since 
A is strongly connected, any path is part of a cycle and thus there can be at most one 
path with given start, end and label. Let n be the panier of states in A. Consider 
paths p > q > r and p S q! S r’ such that |u|, |v v| > n?. Since |u| > i there exists 


f. 


a pair s,s’ which is repeated, that is such that p URAL i q and p' 4, gf > s! 5 q 
with u = hkk’. By condition (iii), we have s = s’. Thus, we have paths p Aat q 
and p’ eg q! with u! = kk’. In the same way there exist paths q > t > r and 


q! >t S r’ for some state t with v = v'w. Since A is unambiguous, the uniqueness of 
the path from s to t with label u’v’ forces q = q’. Thus A is (n?,n”)-local. , 


A word w such that the relation y,4(w) has rank at most one is called a constant for 
the automaton A. The least integer n such that any word of length n is a constant is 
called the order of the automaton. We note that a (A, p)-local automaton has order at 
most A+ p. 

The following result gives a characterization of locally parsable codes in terms of 
automata. It shows in particular that a code X is locally parsable if and only if X* is 
the stabilizer of a state in a local automaton. 


PROPOSITION 3.7 Let A = (Q,1,1) be a finite automaton and let X be the code such 
that A recognizes X*. If A is local, then X is locally parsable. Conversely, for any 
locally parsable code X, there exists a local automaton A = (Q,1,1) recognizing X*. 


Proof Let A = (Q,1,1) be a (4, r)-local automaton and let X be the code such that 
X* is the stabilizer of state 1. Let w be a word of length s = £ +r and let w = hk 


with |h| = £ and ies =r. Let (u, %); (u',v’) € Cx*(w). The paths 1 “S" 1 and 1 a 


decompose in 1 uh p ES 1 and 1 4 b y E ky 1. Since A is (€,7)-local, we have p = p’. Thus 
(u,v), (u’,v) E€ C(w), showing that w is a constant. Thus X has literal synchronization 
delay s. 
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Conversely, let A = (Q,i,T) be the minimal deterministic automaton of X. Let 
A* = (QUw,w,w) be the automaton constructed by the method of Section 0.7. Let us 
show that A* is local. For this consider two cycles p  p and p' S p' with the same 
label w. We will prove that p = p’. Replacing w by some power, we may suppose that 
w is a constant. 

Suppose first that state w does not appear on these cycles. Then these paths are 
paths in A. Let u,v,u’,v’ be such that i > p > t and i LA p' “. t are paths in 
A. We may replace w by a power which is a constant. Since uwv, u'wv' € X, we 
have uwv’,u’wu € X*. Suppose that vu’ = vivi with uwvj, v4 € X*. Then, since w 
is a constant, u’wv', is also in X*. Since X is a code, u’wv’ cannot have a second 
facorization in words of X and thus v5 is empty. Thus shows that uwv’ is in X. In the 
same way, we can show that u’wv € X. Since A is the minimal automaton of X, this 
implies that p = p’. 

Let us now suppose that w appears in one the cycles, say p > p. We have w = uv 
with p > w > p. Let u’,v’ be such that w i p' apa path in A*. Then, since 
|vu| = |w], vu is a constant. Since vu, u’vuvuv’ € X*, we have also u/vuv, vuvv' € X*. 
Thus we have also p! > w > p', which implies p = p’. n 


We now proy the following result, which is the counterp rt, dor Josal ĻRarsable codes 
of Theorem P.Z. e proof is similar to that of Theorem 2-2. 

THEOREM 3.8 Any rational locally parsable code is contained in a complete rational 
code with the same delay. 


Proof Let X be a nonempty rational code with literal synchronization delay s. Let Ps 
be the set of prefixes of length s of the words of X* and let S, be the set of suffixes of 
length s of the words of X*. Let 


M = (P,A*N A*S,) U X* 


Then M is a submonoid. Let Y be the minimal generating set of M. We show that Y 
is a code with the desired properties. Let us first prove that M is stable. For this, let 
u,w,v be such that u, wv, uw,v € M. We distinguish two cases. 
Case 1. Suppose |w| > s. Then uw € M implies that w has a suffix in S, and wv € M 
implies that w has a prefix in P,. Thus w € M. 
Case 2. Suppose |w| < s. We first show that there exists u’ € X* such that u’w € A*S,. 
If u € X*, then, since uw € A*S,, we can take u’ = u. Otherwise, we have u = tr with 
t € A* and r € S,. There exists k € A* such that u’ = kr is in X*. Since |r| = s, the 
suffix of uw which is in S, is a suffix of rw and we have u'w € A*S,. Symmetrically, 
one can prove that there exists a v’ € X* such that wv’ € P,A*. Let u'w = zt and 
wv = pq with z,q E€ A*, t € S, and p E€ P,. Let h € A* be such that ph € X*. Since 
w is a gfx of p, u'w = zt is a prefix of u'ph. Then, from u'ph € ztA*, we deduce 
by espiaiene uw = zt € X*. Similarly, we have wv’ € X*. Since X* is stable, this 
implies w € X*. Thus M is stable. 

Let us prove that X C Y. Let x € X and suppose that x = yy’ with y,y' € M \«. 


Since X is a code, we cannot haye eee Let us suppose that y’ ¢ X*. Then 
y' € P,A* and yy’ € X imply by Paroi y’ is in X*, a contradiction. 
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Let y € P,A* N A*S. Then for any u € A*, we have yuy € P,A*M A*S,. Thus Y is 
complete 

Finally, let us prove that Y has literal synchronization delay s. Let w be a word of 
length s. Let (u,v), (u’,v’) € Cm(w). Then w,u'w € P,A* and wv, wv’ € A*S,. Thus 


(u,v’), (w’,v) E€ Cm (w), showing that w is a constant. / 
EXAMPLE 3.7 Let A = {a,b} and X = {a,ab}. Then X isa ode with literal synchro- 
nization delay 1. The construction of the proof of Theorem gives Y = ab*. 


4 Exercises 


. section4bis.1 
Section 


1.1 Let X be a code with finite synchronization delay s. Show that 


X* =e6eUXU... UXT! U (X5 A* Nn A anne W (7.4.1) 


with W = {w € A* | A*wA* X* = Ø}. Show that W has also the expression 
W = A*V A* with 


V = (A* \ A* X5+1A*) \ (A* \ F(X8+2)) (7.4.2) 


1.2 A nonempty code X is complete and has finite synchronizing delay if and only if 
there is an integer s such that 


X° A* N ATX? CX". 


=ComplRatSynch g 
1.3 Show that the code Y of the proof of Theorem admıts the expression 


Y=XuU(T\W) (7.4.3) 


where T = (X*A* \ X**1A*) N (A* X5 \ A*X**") and W = A* X” A* U X*. 


. section4bis.2 
Section 


2.1 Let s > 1 be an integer and let ~, denote the equivalence on words of length at 
least s defined by y ~, z if y and z have the same prefix of length s, the same sufix 
of length s and the same set of factors of length s. A set Y C A* is said to be locally 
testable of order s if there is an integer s such that for two words y,z € A*A* with 
Y ~s z one has y € Y if and only if z € Y. Show that a set X is locally testable if and 
only if it is a finite boolean combination of strictly locally testable sets. 


2.2 The syntactic semigroup of a set Y C A* is the quotient of At by the syntactic 
congruence. Show that a set Y C A* is strictly locally testable if and only if all 


idempotents of its syntactic semigroup are constants (where a constant in the syntactic 
semigroup is the image of a constant in A*). 


2.3 Show that if Y is locally testable, then for each idempotent e in the syntactic 
semigroup of Y, the semigroup eSe is idempotent and commutative. 
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2.4 Show that a code X is locally parsable if and only if X* is locally testable. 


2.5 The transition matrix of an automaton A = (Q, E) is the Q x Q matrix with 
elements in the set P(A) of subsets of A defined for p,q E€ Q by 


Mp,q ={a€ A| there is an edge (p,a,q) in E}. 


Two automata A = (P, E) and B = (Q, F) are elementary equivalent if there exists 
a P x Q-matrix R and a Q x P-matrix S$ such that RS is the adjacency matrix of A 
and SR is the adjacency matrix of B. 

Show that if A and B are elementary equivalent automata such that all states have 
incoming and outgoing edges, then 

(i) they are simultaneously strongly connected, 
(ii) either R has entries in P(A) and S has 0,1 entries or conversely. 
(iii) they are equivalent, in the sense that the set of words which are the label of a 
path is the same for both automata. 


2.6 Let A be an automaton. An input merge equivalence is an equivalence on the 
set Q of states of A such that for any pair p,p’ of equivalent states, any letter a and 
any state q, one has p > q if and only if p' & q. Thus, in such an equivalence, two 
equivalent states have the same output. 

The quotient of A = (Q, E) by such an equivalence is the automaton A = (Q, E) with 
states the set Q of equivalence classes and edges the induced edges on the classes, i.e. 
(p,a,q) is in E if and only if (p,a,q) € E for some p,q € Q. Thus, in the quotient 
the output of a class of states is the common output of its elements and its input is 
the union, or merge, of the inputs of its elements (whence the name of an input merge 


equivalence). We say that A is obtained from A by an input merge, Elerai 
For example, let A and B be the automata represented on Figure E: T'he automaton 


Figure 7.8: An input merge. 


A is obtained from 6 by an input merge. 

An output merge equivalence is defined symmetrically. It is an equivalence on the set 
Q of states such that equivalent states have the same input. The quotient is defined in 
the same way by merging the outputs of the elements of a class. 

Show that if an automaton A = (P, E) is obtained from an automaton B = (Q, F) by 
an input (or output) merge, then A and B are elementary equivalent. 


2.7 Let A bea (£, r)-local automaton. Show that there is a sequence Ag, A1,...,Agiy 
of (¢,r)-local automata such that 


(i) Ap =A. 
(ii) For i =0,...,@—1, A; is obtained from A;;; by an input merge. 
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(iii) For i = 4,...,4 +r — 1, A; is obtained from A;+ı by an output merge. 
(iv) Arr is a subautomaton of the free (¢,7)-local automaton. 


2.8 Let A1, A2, B1 be strongly connected autmata such that A, is elementary equiva- 
lent to Ag and A, is a subautomaton of 51. We will build an automaton By elementary 
equivalent to 6, and such that A» is a subautomaton of B2. The automaton By is called 
the Nasu embedding of Az with respect to A1, B1. 

Set Ar = (Pi, E1), Ao = (Po, E2) and Bı = (Qi, Fı). Let R,S be matrices such 
that RS and SR are respectively the adjacency matrices of A, and Ag. Let Q2 = 
P U (Fy Ey), We define a Qı x Qo-matrix R and a Qə x Q\-matrix S as follows. By 
Exercise either R or S has 0,1 entries. 

Suppose first that R has entries in P(A) and S has 0,1 entries. Then, for qı € Qı 
and q2 € Q2, we define 


Raa = Raa U {a E A | q2 € Fi \ Fi, 0(g2) = 1,1 (qe) = a} 


where o(q2) denotes the origin of q2 and l(q2) its label. Next, for q2 € Q2 and q € Qı, 


set , 
Se, if q E€ P,a ERP 


Sea = 1 if q2 € Fı \ Fy and e(q2) = qı 
0 otherwise 


where e(q2) denotes the end of q2. 
In the case where R has 0,1 entries and S entries in P(A), we define 


Rag fq E€ Pa € Po 
Recs = 1 if q2 € Fy \ Ey and o(q2) = q1 
0 otherwise 


and 


Soa = Saza U {a E Al q2 E€ Fi \ Æa, Llao) = a, e(g2) = q1} 


It is clear that in both cases, RS is the adjacency matrix of Bı. The automaton 
Bz = (Q2, F2) is defined by its adjacency matrix which is SR. 


igklemEquiv 
Let for example A, aa Ag pe the elementary equivalent automata of Figure ra 


represented on Figure with Continous lines. 


Figure 7.9: The Nasu embedding 


The automaton Bı represented on the left has one additional state and two edges 
labeled c represented with dashes. The Nasu embedding of A2 is represented on the 
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right. It has two additional states corresponding to the additional edges of 51. The 
matrices R and § are 


1 

1 

0 1t- 
1 0 
2.9 Let A; and A be automata such that A; is obtained from Ag by input (or 


output) merge. Show that if Az is a subautomaton of a local automaton Bz, then the 
Nasu embedding of A, with respect to Ag, B2 is a local automaton with the same order. 


2.10 Show that any local automaton is a subautomaton of a complete one with the 


same order. 


. section4.7 
Section 


3.1 Let X C A% be a semaphore code, let A = (Q,1,1) be a deterministic trim 
automaton recognizing X* and let y be the associated ost ES aSpow that any 


group G C y(A*) which meets y(F(X)) is cyclic (use Proposition : 


5 Notes 


The notion of synchronization delay was introduced in Golomb and Gordon] (L965). It 
was proved in (1998) that any rational finite synchronization delay is contained 
in a complete rational one. However, the definition of synchronisation delay used in 


(1998) differs from ours. The construction is basically the same, but does not. 4 4 
allow to preserve the delay. Exercise [-3[is also from (1998). Bxercise feet 
from (L975) (see also (2004). A set X C A* is called 
star-free if it can be obtained from the subsets of the alphabet by a finite number of set 
products and boolean operations (including the complement). Thus star-free sets are 
those regular sets which can be obtained without using the star operation. Examples of 
star-free sets are J, A* (the com lement g 0 : the singletons {a} for a € A and the ideals 
aA* or A*aA*. Formulas ( and show that, for a code X wjth, hounded 
synchronization delay if X is star-free, then X* is also star-free. Formula (ER tows 
that, , if X is star-free, then Y* and thus also Y are star-free. There is a deep link 
between codes with bounded synchronization delay and star-free sets which has been 


investigated in ((L975)) (see (2004) for a connection with 


first-order logic). i 
4bis.2.4 
The term ‘locally parsable’ is from McNaughton, and„Papert| (L971). Exercises pat = 
is from [De Luca and Restivd ({1980). Exercise 2.3[has a converse which is a difficult 
theorem due to Mcnaughton, Zalcstein, Bzrozowski and Simon (see ({L976) ). 


The notions of elementary equivalence of automata and of input or out yt. merge 
are concepts of symbolic dynamics (see Lind and Marcus| ({L995)). Exercise escribes 


a construction due to Nasu (see Lind and Margus ([L995) p. 354, Lemma 10.2.3) and 
called the masking lemma . Exercise2.10/is a result of (2007) 
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spad 
The theorem of the synchronization of semaphore codes (Theorem 2 i) is In|Schtitzenberger 


(1964). This paper contains also a difficult combinatorial proof of this result. 
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Groups of codes 


0 Introduction 


ter4 
We have seen in Chapter Wifthat there is a permutation group G(X) of degree d(X) 
associated with every thin maximal code X which we called the group and the degree of 
the code. We have seen that a code has a trivial group if and only if it is synchronized. 

In this chapter we study the relations between a code and its group. As an example, 
we will see that an indeco ogable prefix code X has a permutation group G(X) which 
is primitive (Proposition eae will also see that a thin maximal prefix code X has 
a regular group iff X = y oVoV oW with U,V synchronous and V a regular group 
code (Proposition Fay 

We study in more detail the gro sof pifjx codes. We start with the implest class, 
namely the group codes in Section pres, in particular (Theorem at a finite 
group code is uniform. 

Tn, the next two sections, we again examine the techniques introduced in Chapter 
Mi and particularize them to bifix codes. Specifically, we shall see that bifix codes 
are charac grized, by the algebraic property of their,s gtic monoids being nil-simple 
(Theorem -Thie proof makes use ,of Theorem ' i 7 concerning codes with bounded 
deciphering d ay. Section is devoted to groups of finite bifix codes. The main result 
is Theorem stating that the group of a finite, indecomposable, nonuniform maximal 
bifix code is doubly transitive. For the proof of this theorem, we use difficult results 
from the theory of permutation groups without proof. This is the only place in the 
book where nonelementary knowledge is required. The last section contains a series of 
examples of finite bifix codes with special groups. 


1 Groups and composition 

We now examine the behavior of the group of a code under composition. ais 
Let G be a transitive permutation group over a set Q. Recall (cf. Sect. Taj that an 

imprimitivity equivalence of G is an equivalence relation 0 on Q stable with respect to 

the action of G, i.e., such that for all g € G, 


p = q mod 0 = pg = qg mod 0 
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The action of G on the classes of 6 defines a transitive permutation group denoted by 
Go and called the imprimitivity quotient of G for 0. 
For any q € Q, the restriction to the class mod 0 of q of the subgroup 


K = {k E€ G | qk = q mod 6} 


formed of the elements globally stabilizing the class of q mod ð is a transitive permuta- 


tion group. D feran" guduced by G on the equivalence classes mod @ are all equivalent 


(see Section D9. Any one of these groups is called the group induced by G. It is denoted 
by G°. 

Let d = Card(Q) be the degree of G, let e be the cardinality of a class of 0 (thus e 
is the degree of G? ), and let f be the number of classes of 6 (i.e., the degree of Go). 
Then we have the formula 


d= ef. 


EXAMPLE 1.1 The permutation group over the set {1, 2,3, 4, 5,6} generated by the two 
permutations 


a = (123456), 8B = (26)(35) 


is the group of symmetries of the hexagon, 


AS 
6 2 


| | 
5 S 3 
} a 
It is known under the name of dihedral group Dg, and has of course degree 6. It admits 
the imprimitivity partition 
{1,4}, {2,5}, {3,6} 
The groups Gg and G? are, respectively, equivalent to G3 and Z/2Z. 
PROPOSITION 1.1 Let X be a very thin code which decomposes into 
X=Yo0oZ 


with Y a complete code. There exists an imprimitivity equivalence 0 of G = G(X) such 
that 
G? =G(Y), Go = G(Z). 


In particular, we have 
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Proof. Set X = Y og Z with B = alph(Y) and ĝ a bijection from B onto Z. Let 
P and S be the sets of states of the flower automata A} (X), A(Z), respectively. 
Let » (resp, Ja he the morphism associated to A} (X) (resp., Ap(Z)). In view of 
Proposition 5, and since Y is complete, there exists a reduction p : P > S. Actually, 
for (u,v) € P — (1,1) we have p(u,v) = (r,s) where u = zr and v = sz with z,z € Z* 
and (r,s) € S. 

Moreover, A (Y) can be identified through @ with the restriction of A7,(X) to the 
states which are in Z* x Z*. As usual, we denote by ô the morphism from M = y(A*) 
onto M’ = (A*) induced by p. Thus Y = fo y. 

Let J (resp., K) be the 0-minimal ideal of M (resp., of M’). Then J c #-1(K), since 
p—'(K) is a nonnull ideal. Thus (J) C K. Since (J) 4 0, we have 


pI) =K. 


Let e be an idempotent in J N y(X*), let R = Fix(e) C P and let G = Ge. Let 
us verify that p is a surjective function from R onto Fix(f(e)). Let indeed s be a 
fixpoint of f = (e). By definition of a reduction, there exist p,q € P such that 
p(p) = plq) = s and (p,e,q) = 1. Since e is idempotent, there exists a fixpoint r of e 
such that (p,e,r) = (r,e,q) = 1. Then p(r) = s by unambiguity, proving the assertion. 
Further, the nuclear equivalence of the restriction of p to R defines an equivalence 
relation 0 on R which is an imprimitivity equivalence of G. Indeed, let r,r’ € R be 
such that p(r) = p(r’). Let g € G and set s = rg, s’ = r'g. By definition of G there 
is an m € M such that g is the restriction to R of eme. Then, since p(eme) is a 
permutation on p(R), we have p(s) = p(s’), proving the assertion. The group Ge) is 
the corresponding imprimitivity quotient Gg. This shows that G(Z) is equivalent to 
Go. 

Let T = {(u,v) € P | u,v € Z*}. Then T can be identified with the states of the 
flower automaton of Y and moreover T = p~'(1,1). Let L be the restriction to T of 
the submonoid N = y(Z*) of M. Then 


eNe = H(e) NN. 


Indeed, one has eNe C H(e) MN since e € y(X*) and X* C Z*. Conversely, if 
n € H(e)NN, then n = ene and thus n € eNe. Since H(e) N N is a group, this 
shows that eN is a minimal right ideal and Ne is a minimal left ideal. Thus e is in the 
minimal ideal of the monoid N. Moreover the restriction to RN T of H(e) A N is the 
Suschkewitch group of L. 

Thus the restriction to RAT of the group H (e)N L is equivalent to the group G(Y). 
On the other hand, since T = p~!(1,1), this group is also the group G° induced by G 
on the classes of 8. - 


EXAMPLE 1.2 Let X = Z” where Z is a very thin code and n > 1. Then 
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N BR 
Ww oy bd 
me BB] 
ov oy e 
mH CO] Or 
NI] OD 
œo CO] NI 
= =| 00 


a 
b 
Table 8.1: The next state function of A(X*) 


EXAMPLE 1.3 Consider the maximal prefix code 
Z =(4 -b UDA? 


over A = {a,b}. Set 


X=27. 
4.1 
The automaton A(X*) is given in Table ST) Let y be the corr sponding representation. 
The monoid y(A*) is the monoid of functions of Example Bat on setting y(a) = u, 
p(b) =v. 
The idempgtent e = (af) has minimal rank since the action of A on the R-class of e 
given in Fig. is complete. Consequently, the group G(X) is the dihedral group D4. 


This group admits an imprimitivity partition with a quotient and an induced group 
both equivalent to Z/2Z. This corresponds to the fact that 


G(Z) = Z/2Z, 


since 
Z=To A’, 


where T is a synchronized code. 


In the case of prefix codes, we can cages the study of the influence of the decomposi- 
tions of the prefix code on the structur gup. We use the maximal decomposition 
of prefix codes defined in Proposition 


PROPOSITION 1.2 Let X be a very thin prefix code, and let 
X=Yo0Z 


be its maximal decomposition. Then Z is synchronized, and thus G(X) = G(Y). 


Proof. Let D = X*(A*)~!, U = {u € A* | u-!D = D}. Then Z* =U. Let y be the 
morphism associated with the automaton A(X*). Let J be the 0-minimal ideal of the 
monoid y(A*). 

Consider x € X* such that y(x) € J. First we show that 


D = {w € A* | (xw) #0} (8.1.1) 


Indeed, if w € D, then zw € D and thus y(xw) # 0. ari if p(aw) 4 0 for some 
w € A*, then the fact that the right ideal aes by ae is 0-minimal implies that 
there exists a word w’ € A* such that y(x) = y(xww’) Wee zpw’ E€ X*. By right 
unitarity, we have ww’ € X*, whence w € D. This proves ( 
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w € Da-!. Then wz € D. r (xwr) #0. Using now the 0-minimality of the 
left ideal generayed by p(x), there exists a word w’ € A* such that y(w'rwr) = y(x). 
Using again ( we = or, a 1 jw" E€ D, 0 4 y(cw") = v(w'zwew'). Then also 
yp(zwrw") £0 and again by eure € D. This shows that D C (wx)~!D. For 
the reverse inclusion, let w” € (wx)~!D. Then wzw” € D. Thus y(xwrw") 4 0. This 
implies that y(aw”) 4 0, whence w” € D. Thus D = (wx)~!D, showing that wa € U, 
hence w € Ut. 

Now we prove that (2,2) is a synchronizing pair for Z. Let w, w’ € A* be such that 
wrrw € Z* = U. Since U C D, we have wrxrw’ € D and thus wx € D. By the 
equality Dx~' = Ux !, this implies wx € U. Si ce U Z is right unitary, cw’ also is in U. 
Thus Z is edhon. In view of ae 1s concludes the proof. n 


Next Dx“! = Ux™!. Indeed D i y implies Dx~' > Ux™t. Conversely, consider 


We now prove a converse of Proposition tT the case of prefix codes. 


PROPOSITION 1.3 Let X be a thin maximal prefix code. If the group G = G(X) admits 
an imprimitivity equivalence 0, then there exists a decomposition of X into 


X=YoZg 
such that G(Y) = G? and G(Z) = Go. 


Proof. Let p be the representation associated with the minimal automaton A(X*) = 
(Q,1,1), and set M = y(A*). Let J be the minimal ideal of M, let e € JN y(X*) 
be an idempotent, let L be the £-class of e and I be the set of H-classes of L. Let 
S = Fix(e). We have G(X) = Ge. 

Since X is complete, each H €T is a group and therefore has an idempotent ey with 
Im(e) = Im(ep) and thus on Be Fix(e). The code X being prefix, ey is in p(X*) 
for all H € T, by Proposition 

By assumption, there exists an equivalence relation 0 on S that is an imprimitivity 
equivalence of the group Ge. Consider the equivalence relation 6 on the set Q of states 
of A(X*) defined by 


p = q mod 6 if and only if VH ET, pep = qep mod 6. 
Let us verify that ĝ is stable, i.e., that 
p=qmod Ê => p- w= q: w mod Ô 
for w € A*. Indeed, let m = y(w). Note that for H ET, 
MEH = €mHMeH = eEmHEMEH (8.1.2) 


since €mHe = emH. Obserye also that emey € H(e) since en € H(e) fralne L 
and since mep €E L by ( ~ Assume now that p = q mod 0. Then by definition 
pemH = Wemy mod 0 and 0 being an imprimitivity equivalence, this implies 


pemHemey = qemHemeH mod 0 


5 7 
By (ra it follows that pmey = qmey mod 0 for all H € T. Thus p-w = q: w mod 0. 
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Moreover, the restriction of Ê to the set S = Fix(e) is equal to 0. Assume indeed 
that p = q mod 6 for some p,q E€ S. Then pe = qe mod 0 . Since p = pe and q = ge, 
it follows that p = q mod 0 . Conversely, if p = q mod 0 , then for all H €T, peg = p 
and qep = q, because of the equality Fix(ey) = S. Consequently p = q mod 6. 

Consider the prefix code Z defined by the right unitary submonoid 


Z* = {z € A*|1-z=1mod 9} 


Then clearly X eed the automaton A(X") being trim, alphz(X) = Z. Thus, 
by Proposition [I[7.9, X decomposes over Z: X = Y o Z. The automaton Ag defined 
by the action of A* on the classes of 6 recognizes Z* since Z* is the stabilizer of the 
class of 1 modulo 6. The group G(Z) is the group Gg. The automaton obtained by 
considering the action of Z on the class of 1 mod 6 can be identified with an automaton 
recognizing Y*, and its group is G®. n 


COROLLARY 1.4 Let X be a thin maximal prefix code. If X is indecomposable, then 
the group G(X) is primitive. 


EXAMPLE 1.4 We pey once more the finite maximal prefix code X = ee b?) U 


b? A2)? of Example [I.3} with the minimal automaton of X* given in Fig. | Let y 
be the associated representation. We have seen that e = (at) is an idempotent of 
minimal rank. The group Ge = G(X) is the dihedral group D4 generated by the 


Figure 8.1: The minimal automaton of X* 4_29 
permutations 
(1458), (15) 


The partition Ô = {{1,5},{4,8}} is an imprimitivity partition of Ge. ð 
The £-class of e is composed of four H-classes. They are represented in Fig. Ba 
together with the associated nuclear equivalences. 
The equivalence ĝ is 
6 = {{1,3,5, 7}, {2, 4, 6, 8}} 


The stabilizer of the class of 1 mod Ê is the uniform code Z = A? with group Z/2Z. 
We have already seen that 


X=(T0o LY aT ot 
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a 
e o > {1,3}, {2,4}, {5,7}, {6,8}, 
e1 i p {1,3}, {2,8}, {4,6}, {5,7}, 
a 
e2 {1,7}, {2,8}, {3,5}, {4,6}, 
2 a 
è a {1,7}, {2,4}, {3,5}, {6,8}, 


Figure 8.2: The £-class of e = y(a*). 


Table 8.2: The Automaton of ((a U bA?)*)* 


12345 6 7 8 9 10 ll 12 
a 43476 7 0 9 0 I 2 1 
b2 3 4 5 67 8 9 10 ll 1 


Table 8.3: The Idempotent e = y(a*) 
1 2 3 4 5 6 7 8 9 10 11 12 


a COCO ia 


for some synchronized code T. The decomposition of X into X = 7°82 is that obtained 
by applying to X the method used in the proof of Proposition oa 


EXAMPLE 1.5 Let Z be the finite complete prefix code over A = {a,b}, 
Z =aUbA?. 


and consider 
X= 2Z'. 


The automaton A(X*) is given in Table Ba Tet y be the representation associated 
with A(X*). The element e = (at) is easily seen to f an idempotent of minimal 
rank 4, with Fix(e) = {1,4,7,10}. It is given > Table SEN minimal ideal of y(A*) 
reduces to the R-class of e, and we have G(X Z/ 4Z, as a result of computing the 
L-representation with respect to e given in Fig. ‘eat The partition 0 = {{1,7}, {4,10}} 
is an imprimitivity partition of Ge The corresponding equivalence Ê is 


a/(1, 4, 7, 10) b/(1, 10, 7, 4) 


C œx _ D 


Figure 8.3: The £-representation with respect to e. 


6 = {{1,3,5,7,9, 11}, {2, 4,6,8, 10, 12}}. 
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The stabilizer of the class of 1 mod 6 is the uniform code A’, and we have 
KEY, 


Observe that we started with X = Z*. In fact, the words in Z all have odd length, and 
consequently 
Z =Yo0 A? 


for some Y. Thus X has the two decompositions 


X =Z =Y?’o0 A. 


2 Synchronization of semaphores 
apter2 
In this section, we prove the result announced in Chapter i j namely, 


THEOREM 2.1 Let X be a semaphore code. There exist a synchronized semaphore code 
Z and an integer d > 1 such that 


cae 
In view of Proposition d is of course the degree d(X) of the code 
X. Observe that, by Proposition ane Corollary a semaphore code is a thin 
maximal code and thus its degree d(x ) and its group G(X) are well defined. 


The proof of the theorem is in yar parts. We first examine the group of a 
semaphore code. However, the following proposition is just an intermediate step, since 
the theorem implies a stronger property. 


We recall that a transitive permutation group over a set is a, regular if its 


elements, with the exception of the identity, fix no point (See Sect. 


LEMMA 2.2 The group of a semaphore code is regular. 


Proof. Let X C A* be a semaphore code, let P = XA” be the set of proper prefixes 
of words in X, and let A = (P,1,1) be the literal automaton of X*. Let y be the 
representation associated with A, and set M = y(A* 

A semaphore code is thin (by Proposition WI and complete. Thus M has a 
nonzero minimal ideal, 0 ¢ M and the ideal y(F'(X)) of images of words which ar hot 
factors of words in X contains the minimal ideal, say K, of M. By Propostion he 
Suschkevitch group of y(A*) is equivalent to G(X). 

Let e be an idempotent in y(X*) N K, and let R = Fix(e). These fixed points are 
words in P. They are totally ordered by their length. Indeed let w be in y~!(e)N F(X). 
Then we have r-w =r for all r € R. Since w is not a factor of a word in X, no rw is 
in P. Thus each word r € R is a suffix of w. Thus, for two fixed points of e, one is a 
suffix of the other. 


4 
Next, we recall that, by Corollary reat 


PX C X(PUX). 
By induction, this implies that for n > 1, 


PX” c X"(PU X). (8.2.1) 
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To show that Ge is regular, we verify that each g € H(e) N y(X*) increases the length, 
i.e., for r,s € R, 

Ir] < |s| = |rg| < |sg| (8.2.2) 
This implies that g is the identity on R since the above property cannot be satisfied if 
g has a nontrivial cycle. Since H (e)N y(X*) is composed of the elements of H (e) fixing 
1, this means that only the identity of Ge fixes 1. Since Ge is transitive, this implies 
that Ge is regular. 


2 
For the proof of , let g € H(e) N p(X*), and let x € 7t(g). Then x € X” for 
some n > 0. Let r,s € R with |r| < |s|. Then by ( ; 


rz = yu, S£ = ZU with y,z E€ X”, u,v€ PUX. 


2 
The word u is a suffix of v since otherwise z € A*yA* (see Fig. Ei which implies 
X” N A*X” At Æ Ú, contradicting the fact that X” is semaphore. Further, we have in 
A 


rg=uor 1 according touc Poru€e X, 


sg=vor 1 according tov€ Porve X. 


Since g is a permutation on R and lg = 1 and s £1, we have sg # 1. Thus sg = v. 


r X 
ò y u ò 
; z , ; 


Figure 8.4: Comparison of ra and sa. 


Since r Æ s, we have rg 4 sg. Since u is a suffix of v, we have |rg| < |sg| both in the 
two cases rg = u and rg = 1. n 


Now let X C AT be a group code. Then by definition, 
X* =a 7!(H) 


where a : A* — G is a surjective morphism onto a group G and H is a subgroup of G. 
The code X is called a regular group code if 


Hi 


Then the permutation group G(X) is the representation of G by multiplication on the 
right over itself. It is a regular group. zi 
The following proposition is useful for the proof of Theorem However, it is 


interesting in itself, because it describes the prefix codes having a regular group. 


PROPOSITION 2.3 Let X be a thin maximal prefix code. Then the group G(X) is regular 
if and only if 
X =UoVowW, 


where V is a regular group code and U,W are synchronized codes. 
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-6.6 
Proof. The condition is sufficient. Indeed, if X = U o V o W, then by Proposition f i 


we have G(X) = G(V). 

Conversely, let A = (Q,1,1) be an unambiguous trim automaton recognizing X*, let 
y be the associated representation and M = y(A*). Since X is thin and complete, the 
minimal ideal J of M is a union of groups. 

Consider an idempotent e E€ y(X*)N J, let G = H(e) be its H-class, L its L-class 
and let I be the set of H-classes contained in L. Each of them is a group, and the 
idempotent of H will be denoted by ey. The set of pairs 


{(ex,e) | H ET} 


is a system of coordinates of L relative to e. Indeed eye € H. Moreover, since e € Mey, 
eey = e and thus eepe = e. Let us ee, corresponding £-representation of M. 


For this choice of coordinates, by ( , we have for m € M and H ET, 

m x H = rmeyl (8.2.3) 
where e = lr is the column-row decomposition of e. Indeed, we have in this case 
r™H — re =r and ly = eyl. 

Set 


N={neM|VH €T,n* H =n*xG}. 
The set N is composed of those elements n € M for which the mapping 


HeTrent«eHeG 


Toi 
is constant. It is a right-unitary submonoid of M. Indeed, first 1 € N by (Bt and 
if n,n’ € N, then 


nn! x H 


(n«n'H)(n' x H) (8.2.4) 
(saji +0) 


whic. is independent of H. Thus nn’ € N. Assume now that n,nn’ € N. Then by 
and since n x n'H) and n » G have an inverse in Ge 


nl x H = (nx n' HY (nn * H) = (n x G) t (nn *G) 
which is independent of H, showing that n’ € N. Thus 
p(n) =W" 


for some prefix code W. 
The hypot iş that G(X) is regular implies that X* c W*. Indeed, let m € y(X*). 
Then by ( we have for H €T, 


mx H = rmepc. 


.5.8 
Since X is prefix, ey € y(X*) by Proposition feat Consequently m x H fixes the state 
1 € Q (since r,m,ey and l do). Since G(X) is regular, m x» H is the identity for all 
H eT. This shows that m € N. 
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We now consider the function 
0 : W* — Ge 


4 
which associates to each w € W* the permutation y(w)*G. By Bii is a morphism. 
Moreover, 0 is surjective: if g € G, then 


g * G = rgel = rgl 


which is the element of Ge associated to g. From gx H = rgeyl = r(ge)ex(el) = rgel = 
rgl, it follows that g € N. 

For all x € X*, since y(x) x G = 1, we have (x) = 1. a6 

Since X* Cc W* and X is a maximal code, we have by Proposition fit 


X=Y oW, 
where 3 : B* — A* is some injective morphism, 6(B) = W and 6(Y) = X. Set 
a=0o0ß. 


Then a : B* — Ge is a morphism and Y* C a7!(1) since for all x € X*, we have 
(x)= 1. Let V be the regular group code defined by 


Vee Om. 
Then Y = U o V and consequently 
X =UoVow. 


By construction, G(V) = Ge. Thus G(X) = G(V). The codes U and W_a e,syn- 
chronized. Indeed d(X) = d(V) and d(X) = d(U)d(V)d(W) by Proposition BE 
d(U) = d(W) = 1. This concludes the proof. n 


7.1 
The following result is the final lemma needed for the proof of Theorem p: i 


LEMMA 2.4 Let Y C Bt be a semaphore code, and let V 4 B be a regular group code. 
If 
Y* c V*, 


then 
Y = (C* D)’ 


for some integer d, where C = BAV and D = B—C. Moreover, C*D is synchronized. 


Proof. Let a : B* — G be a morphism onto a group G such that V* = a71!(1). Since 
V Æ B, we have G F {1}. We have 


C={be Bla(b)=1} adD={beEB]ath#i} 


The set D is nonempty. We claim that for y € Y, |y|p > 0. Assume the contrary, 
and let y € Y be such that |y|p = 0. Let b € D. Then a(bu) 4 1 for each prefix u 
of y since a(u) = 1. Thus no prefix of by is in V, whence in Y. On the other hand, 
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B*Y C Y B* because Y is semaphore (Proposition ip 3) This gives the contradiction 
and proves the claim. 

Set T = C*D. Let d be the minimum value of |y|p for y € Y. We will show that for 
any t = tytg---tg, with t; € T and y € Y such that |y|p = d, there is a word y in Y 
such that y == tv and v is a prefix of y. 

Indeed, since Y is a semaphore code, tay € Y B*. Thus 


tay = Y1W1 


for some yı € Y, wı € B*. We have |y;|p > d by the minimality of d and |yi|p < d+1 
since |yi|p < |taylp =d+ 1. If |yi|p =d +1, then wı € C* and thus 


a(yi) = a(y1w1) = a(ta) A 1, 


a contradiction. 
Thus, |y1|p = d, |wi|p = 1. In the same way, we get 


td-1Y1 = Y2W2,---,t1Yd-1 = YdWa; 


where each of the yo,..., ya satisfies |y;|p = d each wo,...,wg is in C*DC%. 
Composing these equalities, we obtain (see Fig. 


ty = tte... tay = YdWdWd-1°°* W1- (8.2.5) 
Since yg € (C*D)4C* and t € (C*D)4, we have 
Ya = tıit2 -tqv E Y (8.2.6) 


for some v € C* which is also a prefix of y. This proves the claim. 

This property holds in particular if tı € D, showing that Y contains a word x ( = ya) 
with d letters in D and starting with a letter in D, i.e., x € (DC*)2. Thus z is one of 
the words in Y for which |æ|p is minimal. Substi qv for y in EEEE Thon starting 
with any word t = tıt2:-- ta € T?, we obtain e i v = 1, since v is in C* and 
is a prefix of x. This shows that t € Y. Thus T C Y. Since T? is a maximal code, we 


have T¢ = Y. Since B*b C T* for b € D, the code T is synchronized. n 
aloi -7.2 
Proof of Theorem FF Tt X be emaphore code. By Lemma Beatie group G(X) 
is regular. In view of Proposition p we have 
X=UoVowW, 


where V is a regular group code and U and W are synchronized. Set Y = U o V. If 
d(V) = 1, the ry is synchronized and there is nothing to prove. Otherwise, according 
to Lemma 22-4, there exists a synchronized code T such that Y = T¢. Thus 


X=TtoW = (To W). 


The code Z = T o W is synchronized because, T. and W are. Finally, since X = Z7 is 
semaphore, Z is semaphore by Corollary eP is proves the theorem. n 
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Figure 8.5: 4_33 


Figure 8.6: The automaton A(X*) and its table. 4_34 


EXAMPLE 2.1 Let Z be the semaphore code Z = {a, ba, bb} over A = {a,b}. This code 
is synchronized since A*a C Z*. Set 


X=7? 


4 
The minimal automaton A(X*) is given by Fig. Ba Let y be the associated rep- 
resentation and M = y(A*). The element e = (a?) is an idempotent of minimal 
rank 2 = d(X). Its L-class is composed of two groups 3 = H(e) and Gp. The 


£-representation of M with respect to e is given in Fig. with the notation a in- 


b/(1) 


b/(13) 


Figure 8.7: The £-representation of M. 4_35 


7.3 
stead of y(a). The prefix code W of Proposition ES is W = Z. Indeed, we have 
a*1=ax*2= (13); bax 1 = ba x 2 = (13); bb x 1 = bb* 2 = (13). In this case, the code 
U is trivial. 
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EXAMPLE 2.2 Consider, over A = {a,b}, the synchr ized semaphore code Z = a*b. 
Let X = Z?. The automaton A(X*) is given in Fig. K2. 


4_36 


Figure 8.8: The automaton of X* = [(a*b)?]* 


Let y be the associated representation. The element e = (b?) is an idempotent. Its 
set of fixed points is {1,3}. The £-class of ç is reduced to the group H(e), and the 
monoid N of the proof of Proposition E irefore is the whole monoid y(A*). Thus 
W = A. The morphism a from A* into Ge is given by 


a(a) = id{1 3}, a(b) = (13). 


We have 
X=UoV 
with 
V = a U ba*b. 
This example illustrates the fact that even when X is a semaphore cod he gode U in 
the statement of Proposition pas be non trivial and that Lemma Is needed to 


obtain the decomposition X = Z?. 


3 Group Codes 


Let us first recall the definition of a group code. Let G be a group, H a subgroup of G. 
Let y : A* — G be a surjective morphism. Then the submonoid y~!(H) is biunitary. 
It is generated by a bifix code called a group code. =e l 

A group code iş 2 maximal code (see Section op tt is thin iff it is recognizable 
(Example jE 3 or equivalently, if the index of H in G is finite. 

Rather than define a group code by an “abstract” group, it is frequently convenient 
to use a permutation group. This is always possible for a group code X by considering 
the minimal automaton of X*. We give here the detailed description of the relation m 
between the initial pair (G, H) and the minimal automaton of X* (see also Sect. ‘Papen 
Let G be a group and H a subgroup of G. Let Q be the set of the right cosets of H 
in G, i.e., the set of subsets of the form Hg, for g € G. To each element g in G, we 
associate a permutation 7(g) of Q as follows: for p = Hk, we define 


pm(g) = H(kg). 


It is easily verified that m is well defined and that it is a morphism from the group 
G into the symmetric group over Q. The subgroup H is composed of the elements of 
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G whose image by m fixes the coset H. The index of H in G is equal to Card(Q). In 
particular H has finite index in G iff 7(G) is a finite group. 
Now let y : A* — G be a surjective morphism. Let X be the code generating 
X* = y7} (H).For all u,v € A*, 
Holu) = Hy(v) & ut X* =u lx". 


Indeed, set g = y(u), k = y(v). Then Hg = Hk iff gH = k“'A (since (Hg)! = 


g 1H). Further u-lX* ="! ya) v 1X =y-1(k-!H). This proves the formula. 
According to Example [IVB], we have the equality 
Card(Q) = d( X). (8.3.1) 


THEOREM 3.1 Let X C At be a group code. If X is finite, then X = A? for some 
integer d. 


Proof. Let A = (Q,1,1) be the minimal automaton of X*, and let e, the associated 
representation. Let d be the degree of X. Then d = Card(Q) by ( ees 

Consider the relation on Q defined as follows: for p,q € Q, we have p < q iff p=q 
or q # 1 and there exists a simple path from p to q in A. Thus p < q iff p = q, or there 
exists a word w € A* such that both p-w = q and p- u Æ 1 for each left factor u Æ 1 
of w. This relation is reflexive and transitive. 

If X is finite, then the relation < is an order on Q. Assume indeed that p < q and 
q < p. Then either p = 1 and q = 1 or both p Æ 1, q #1. In the second case, there 


exist simple paths p —> q and q —> p. There are also simple paths 
1 => p, p = 1. 


Thus for all i > 0, the paths 


p—l 
are simple, showing that u(ww’)*v C X. Since X is finite, this implies ww’ = 1, whence 
= T hus < is an order. Now let a,b € A be two letters. According to Proposition 
Papes d have 
af, bt E X. 
Thus none of the states 1-a, 1-b for 1 < i < d is equal to 1. Consequently, 


1<1l-a<1l-a? eee 21g See Sle 


and l 
1<1:b<1b <. Sie <. Seis. 


Since Q has d states, this implies that 1-a’ = 1.b for alli > 0 . Thus y(a) = (b) for 
all a,b € A. We get that for all w € A* of length n, we have w € X* iff a” € X*, i.e., 
iff n is a multiple of d. This shows that X = Af. n 


The following theorem gives a sufficient condition, concerning the group G(X), for a 
bifix code to be a group code. It will be useful later, in Section bf 
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THEOREM 3.2 Let X be a thin maximal bifiz code. If the group G(X) is regular, then 
X is a group code. 


4.3 
Proof. According to Proposition ES there exist two synchronous codes U, W and 
a group code V such that 
X =UoVow. 


Since X is thin maximal bifix, so are U and W (Proposition (ilr. 9): Bipce U and W are 
synchronous, they are reduced to their alphabets (Example M62). 


this gives the result. " 


THEOREM 3.3 Let X C At be a code with A = alph( X). The two following conditions 
are equivalent. 

(i) X is a regular group code, 

(ii) X* is closed under conjugacy, i.e., uv E X* => vu € X*. 


Proof. If X is a regular group code, the syntactic monoid of X* is a group G = y(A*) 
and X* =p71(1). If uv € X*, then y(u)y(v) = 1, hence also y(v)y(u) = 1, showing 
that vu is in X*. 

To show the other implication, let us first show that X is bifix. Let u,v € A* be such 
that u,uv € X*. Then also vu € X*. Since X* is stable, it follows that v € X*. Thus, 
X* is right unitary. The proof for left unitarity is analogous. Now let M = y(A*) 
be the syntactic monoid of X*. We verify that y(X*) = 1. For x € X*, we have the 
equivalences 

uzv E€ X* & rvu E€ X* & vu E€ X* SuveE X*. 


Thus, y(x) = (1). Since (1) = 1, it follows that y(X*) = 1. 

Finally, we show that M is a group. From A = alph(X) , for each letter a € A, there 
exists x € X of the form x = uav. Then avu € X*, whence y(a)p(vu) = 1. This shows 
that all elements y(a), for a € A, are invertible. This implies that M is a group. 7 


COROLLARY 3.4 Let X C At be a finite code with A = alph(X). If X* is closed under 
conjugacy, then X = A? for some d > 1. 


4 Automata of bifix codes 


The general theory of unambiguous monoids of relations takes a nice form in the case of 
bifix codes, since the automata satisfy some additional properties. Thus, the property 
to be bifix can be “read” on the automaton. 


PROPOSITION 4.1 Let X be a thin maximal prefix code over A, and let A = (Q,1,1) 
be a deterministic trim automaton recognizing X*. The following conditions are equiv- 
alent. 

(i) X is maximal bifix, 

(ii) for all w € A*, we have 1 € Q- w, 

(iii) for all w € A*, q-w=1.w implies q=1. 
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Proof. In a first step, we show that 
(it) = X is left complete. (8.4.1) 


If (ii) is satisfied, consider a word w, and let q E€ Q be a state such that q- w = 1. 
Choose u € A* satisfying 1-u = q. Then 1-uw = 1, whence uw € X*. This shows 
that X is left complete. Conversely, assume X left complete. Let w € A*. Then there 
exists u € A* such that uw € X*. Thus, 1 = 1 - uw = (1 - u) - w shows that 1 € Q- w. 
Next, we prove that 
(iii) = X*is left unitary. (8.4.2) 


Assume that (iii) holds and let u,v € At be such that u,vu € X*. Set g=1-v. Then 
l-w=landq-u=(1-v)-u=1lwu=1. Thus, g-u=1-u, whence q = 1. This shows 
that 1-v = 1 and consequently v € X*. Assume conversely that X* is left unitary and 
let w be such that q -w = 1-w = p for some p,q E€ Q. Let u,v € A* be such that 
1-w=q,p:v=1. Then 

1- uwv = 1 = 1. wv. 


Thus, uwv, wv € yd by unitarity x *. Conse yently, 1-u=1=dq, proving 
the equivalence ee vie gt, ae ESS the proposition is a direct 
consequence of ER ape m 

Let X be a thin maximal bifix code, and let A = (Q,1,1) be a trim deterministic 
automaton recognizing X*. Then the automaton A is complete, and the monoid M = 
pa(A*) is a monoid of (total) functions. The minimal ideal J is composed of the 
functions m such that Card(im(m)) = rank(m) equals the minimal rank r(M) of M. 
The H-classes of J are eae Ph the minimal images and by the maximal nuclear 
equivalences (Proposition ach state appears in at least one minimal image and 
the state 1 is in all minimal images. Each H-class H meets y(X*) and the intersection 
is a subgroup of H. Note the following important fact: If S is a minimal image and w 
is any word, then T = S- w is again a minimal image. Thus, Card(S) = Card(T) and 
consequently w realizes a bijection from S onto T. 

In the sequel, we wi i oo in the minimal automaton A(X*) of X*. Ac- 
cording to Proposition is automaton is complete and has a unique final state 
coinciding with the initial state. Thus, A(X*) is of the form considered above. 

Lety be the representation associated with the minimal automaton A(X*) = (Q,1,1), 
and let M = y(A*). Let J be the minimal ideal of M. We define 


E(X) = ¢"*(J). 
This is an ideal in A*. Moreover, we have 


for all minimal images S and 


we E(X) & f T of Awe have S-w=T-w. 


(8.4.3) 


Indeed, let w € E(X). Then U = Q-w is a minimal image. For any minimal image 
T, we have T-w C Q-w = U, hence T -w = U since T -w is minimal. Thus, 
T-w=S-w=Q-w. Conversely, assume that for w € A*, we have S- w = T -w for 


J. Berstel, D. Perrin and C. Reutenauer Version 22 janvier 2008 


350 VIII. Groups of codes 


all minimal images S,T. Set U equal to this common image. Since every state in Q 
appears in at least one minimal image, we have 


a 


where the union is over the minimal images. This show, hat (w) has minimal rank, 
and consequently w € E(X). Thus, the equivalence ( ris proved. 


PROPOSITION 4.2 Let X be a thin maximal bifiz code and let A(X*) = (Q,1,1) be the 
minimal automaton of X*. Let p,q E€ Q be two states. Ifp-h=q-h for allh € E(X), 
then p =q. 


Proof. It suffices to prove that for all w € A*, p- w = 1 iff q: w = 1. The conclusion, 
namely that p = q, follows then by the definition of A(X *). 

Let h € E(X) NX*. Let w € A* be such that p-w = 1. We must show that q-w = 1. 
We have p -wh = (p-w)-h =1-h = 1, since h € X*. Now wh € E(X), hence by 
assumption q: wh = p- wh = 1. Thus, (q:w):-h = 1. By Proposition ii ge it follows 
that q-w=1. This proves the proposition. n 

For a transitive permutation group G of degree d it is customary to consider the 
number k(G) which is the maximum number of fixed points of an element of G distinct 
from the identity. We call minimal degree of G the number 6 = d — k(G). The group 
is regular iff k(G) = 0, it is a Frobenius group if k(G) = 1. 

If X is a code of degree d and with group G(X), we denote by k(X) the integer 
k(G(X)). We will prove 


THEOREM 4.3 Let X C A* be a thin maximal bifix code of degree d,and let k = k(X). 
Then 
A" — AX A* C E(X) 


LEMMA 4.4 With the above notation, let A = (Q,1,1) be the minimal automaton rec- 
ognizing X*. For any two distinct minimal images S and T of A, we have 


Card(S ANT) < k 


Proof. Let M = p4(A*), and consider an idempotent e € M having image S, i.e., such 
that Qe = S. Consider an element t € T — S, and set s = te.Then s € S, and therefore, 
s #t. We will prove that there is an idempotent f separating s and t, i.e., such that 
sf Atf. 

According to Proposition fet there exists h € E(X) such that s-h A t-h. Let 
m = y(h) € J, where J is the minimal ideal of M. Multiplying on the right by a 
convenient element n € M, the element mn € J will be in the £-class characterized 
by the minimal image T. Since n realizes a bijection from Im(m) onto Im(mn) = T 
we have smn Æ tmn. Let f be the idempotent of the H-class of mn. Then f and mn 
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1 2 3 4 5 
a 1 4 52 3 
b 2 3 1 1 3 


Table 8.4: The Automaton A(X*) 


have the same nuclear equivalence. Consequently sf # tf. Since t € T = im(mn) = 
im(f) = Fix( f), we have tf = t. 

Consider now the restriction to T of the mapping ef. For all p€ SMT, we obtain 
pef = pf =p. Thus ef fixes the states in SAT. Further t(ef) = sf # t, showing that 
ef is not the identity oy 7’. Thus, by definition of k, we have Card(SNT) < k. 


Proof of Theorem et A = (Q,1,1) be the minimal automaton of X*. Let w € 
A* — A* X A* and set w = aja2--- ax with a; E A. Let S be a minimal image. For each 
i €1,---,k, the word aja2---a; defines a bijection from S onto S; = S - ayaq--- aj. 


Since S; is a minimal image, it contains the state 1. Thus Sp contains all the k + 1 
states 
1+ aj,a2°-+ap,1-ag-++ap,-++ l- ak, 1 


. These states are distinct. Indeed, assume that 
1- ajaj41 +++ ak = 1-aj--- ax 


for some i < 7, ‘Then setting q = 1- ajaj41---aj;_-1, we get q-aj---axy =1-a;--- ax. By 
Proposition inn implies q = 1. But then w € A*X A*, contrary to the assumption. 

Thus S-w contains k + 1 states which are determined in a way independent from 
S. In other words, if T is another minimal image, then T - w contains t ese, same 
k +1 states. This means that Card(T-wNS-w) > k, and by Lemma kti we have 


S- w = Ty., Thus two arbitrary minimal images have the same image by w. This 
shows by ( that w is in E(X). m 
Remark Consider, in Theorem #3 the special case where k = 0, i.e., where the group 


G(X) is regular. Then 1 € E(X). Now 


1€ E(X) = X is a group code. (8.4.4) 


Indeed, if 1 € E(X), then the syntactic monoid M = y 4(x»)(A*) coincides with its 
minimal ideal. 

This minimal ideal is a single group since it contains the neu ral glement of M. The 
converse is clear. Thus we obtain, in another way, Theorem B.J. 


EXAMPLE 4.1 If X is a thin maximal bifix code over A with degree d(X) = 3, the 
k =0 (if G(X) = Z/3Z) or k = 1 (if G(X) = G3). In the second case by Theorem #3) 
we have 


A-XCE(X). 


The following example shows that the inclusion A C E(X) does not always hold. Let X 
be the maximal prefix code over A = {a,b} defined by the auto aton A(X*) = (Q,1,1) 
with Q = {1, 2,3,4,5} and transition function given in Table i i 
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The set of images, together with the actions by a and b, is given in Fig. Bo 

Each of the images contains the state 1. Consequently X is a bifix code. We have 
d(X) = 3 (which is the number of elements of the minimal images). We have Q -b = 
{1,2,3}. Thus y,(b) has minimal rank; consequently b € E J» flowever, a ¢ E(X) 
since Q -a = Q. In fact a € X, in agreement with Theorem 1.3, 


THEOREM 4.5 Let X be a thin maximal bifix code. Then the code X is indecomposable 
iff G(X) is a primitive group. 


a) 
Proof. If X = Y o Z, then rh £ are thin maximal bifix codes by Proposition iio 


According to Proposition [VIJI-I, there exists an imprimiti vity partition 0 of G(X) 
such that G? = G(Y) and Gg = G(Y) and Gg = G(Z). If G(X) is primitive, then 
Go = 1 or G? = 1. In the first case, d(Y) = 1, implying X = Z. In the second case, 
d(Z) = 1, whence Z = A. Thus, the code X is indecomposable. E5 

The converse implication follows directly from Corollary VII.A. n 


b 


b a 

b ae 

a 12345 ———_»> 123 b 145 
‘ab OT 


Figure 8.9: The diagram of images. 


5 Depth 


Let S be a finite semigroup, and let J be its minimal (two-sided) ideal. We say that S 
is nil-simple if there exists an integer n > 1 such that 


S cI (8.5.1) 
i 
The smallest integer n > satisfying (sam called the depth of S. Since S” is, for all 


n, a two-sided ideal, is equivalent to S” = J, which in turn implies S” = $”+! 


PROPOSITION 5.1 Let S be a finite semigroup. The following conditions are equivalent. 
(i) S is nil-simple. 
(ii) All idempotents of S are in the minimal ideal J of S. 


Proof. (i) = (ii) Let n be the depth of S. For any idempotent e in S, we have e = e” € J 

(ii) > (i) Set n = 1+ Card(S). We show the inclusion S” C J. Indeed let s € S”. 
Then s = 5182--- Sn, with s; E S. Let ti = 5159---s;, for 1 > i > n. Then there exist 
indices i,j with 1<i<j<nandt; = tj. Setting r = s;,1---8;, we have 


tir = ti, 


hence also t;r* = t; for all k > 1. Since S is finite, there exists an integer k such that 


e = r* is an idempotent. Then e € J, and consequently 


S = tiSi+1 °t Sn = tiesit Sn EJ 
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This proves that S' is nil-simple. n 


We shall use nil-simple semigroups for a characterization of bifix codes. Before stating 
this result, we have to establish a property which is interesting in itself. 


PROPOSITION 5.2 Let X C At be a thin maximal bifix code, and let p be the repre- 
sentation associated to an unambiguous trim automaton recognizing X*. Let J be the 
minimal ideal of p(A*). Then 

g(H(X)) cJ 


Recall that H(X) = ATXA" is the set of internal factors of X, and H(X) = A*—H(X) 
Proof. Let pp be the representation associated with the flower automaton of X, set 
Mp = ¢p(A*) and let Jp be the minimal idea gt Mp. It suffices to prove the result for 
pp. Indeed, there exists by Proposition VINL., a surjective morphism ô : Mp — y(A*) 
such that y = fo yp., we have ô(Jp) = J. 

Thus the inclusion yp(H(X)) c Jp implies y(H(X)) c (Jp) = J. Thus it remains 
to prove the inclusion yp(H(X)) C Jp. 

Let Ap = (Q, (1,1)(1,1)) be the flower automaton of X. Let w € H(X). Then w has 
d = d(X) interpretations. We prove that rank(yp(w)) = d. Since this is the minimal 
rank, it implies that yp(w) is in Jp 

Clearly rank(yp(w)) > d. To prove the converse inequality, let I be set of the d 
interpretations of w. We define two relations 


a € {0, l}gxr, 8 € {0, 1}rxQ 
as follows : if (u,v) € Q, and (s,z,p) € I, with s € ATX, x € X*, pe XA, then 


((u, v), Q, (s, x, p)) = ON 
((s, £, p, ), B, (u, v)) Öp,u 


where ô is the Kronecker symbol. We claim that 


ppw) = ap. (8.5.2) 


Assume first that (u,v)a(u’,v’). Then there exists an interpretation i = (v, x,u’) € I 
such that (u,v)aiG(u’,v’). Note that i is uniquely determined by v or by u’, because 
X is bifix. Next w € vX*u’, showing that ((u, v), pp(w), (u’,v’)) = 1. 

Conversely, assume that ((u, v), pp(w), (u’,v’)) = 1. Then either ww = u’ and v = 
wv’, or w E€ vX*u'. The first possibility implies the second one: Indeed, if uw = u’ and 
v = wv’, then uwv’ € X. Since w € H(X) this implies u = v’ = 1 =u! = v. It follows 


that w € vX*u'. Thus, w = vru’ for some x € X*, showing that. 7 5 (vz 1) is an 
interpretation of w. Thus, (u,v)ai and iG(u’,v’). This proves ( . By ERP ve 


have rank pp(w) < Card(I) = d(X). m 
The following result gives, a ebraic characterization of finite maximal bifix codes. 
The proof uses Theorem [M.f on codes with finite deciphering delay. 


THEOREM 5.3 Let X C At be a finite mazimal code, and let A be an unambiguous 
trim automaton recognizing X*. The two following conditions are equivalent. 
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(i) X is bifia, 
(ii) the semigroup y,4(A*) is nil-simple. 
Proof. Set p = ya, and set S = y(AT). Let J be the minimal ideal of S. 
(i) (ii) Let n be the maximum of the lengths of words in X. A word in X of length n 


cannot be an internal factor of X. Thus A” A* C H( Qhserve that A” A* = (AT)". 
Thus, S” = y((At)”) c y(H(X)). By Proposition B-4 we obtain S” C J, showing 


that S is nil-simple. 
GC ODO 


Figure 8.10: The minimal automaton of (a*b)*. 5_02 


(ii) => (i) Let n be the depth of S. Then for all y € A” A* = (AT)”, we have y(y) € J. 
We prove that for any y € X”, and for all x € X, u € A*, 


zyu € X* => yu € X*. (8.5.3) 


The semigroup S contains no zero. Further, the elements y(y) and y(yry) of p(X *) 
are in the same group, say G, of the minimal ideal, because y(yzy) = y(yx)y(y) and 
oly) = [v(yx)]~!v(yry), showing that y(y)Ly(yxy). The same argument holds for the 
other side. In fact, both y(yx) and y(yx)~! are in the subgroup GN y(X*). Thus there 
exists some r € X* such that [y((yx)|]~! = y(r), or also y(y) = v(r)y(yzry). 


This gives 
plyu) = p(r)oly)plzyu) € p(X"), 
showing that X*. This proves ( = 
Formula shows that the code ap finite deciphering delay (from left to right) 
at most n. According to Theorem [Id], X is a prefix code. Symmetrically, X is suffix. 
Thus X is a bifix code. n 


:2: 1 
EXAMPLE 5.1 Consider again the maximal bifix code X of Example EF The semi- 


group pax (AT) is not nil-simple. Indeed, y(a) is a permutation of Q and thu 3 
p(a”) g J of for all n > 1. This shows that the implication (i) > (ii) of Theorem b.3f 
is, in general, false, without the assumption of finiteness on the code. 


EXAMPLE 5.2 Let A = {a,b} and X = a*b. The code X is maximal prefix, but is not 
suffix. The automaton A(X*) is given in Fig. 


The semigroup (A+) is nil-simple: it is composed of the two constant functions y(a) 
a: $b). This example shows in addition that the implication (ii) > (i) of Theorem 
may become false if the code is infinite. 


6 Groups of finite bifix codes 

the case o a thin „maximal bifix code X, the £-representation, introduced in Chapter 
Ri (Section Bf of the minimal automaton of X* takes a particular form which makes 
it easy to manipulate. 
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Consider a thin maximal bifix code X C A* of degree d, let A(X*) = (Q,1,1) be 
the minimal (deterministic) automaton of X* and let y = pa(x+) be the associated 
representation. Finally, let M = y(A*) and let J be the minimal ideal of M. Each 
H-class of J is a group. Fix an idempotent e € J, let S = im(e) = Fix(e), and let T 
be the set of H-classes of the £-class of e. Denote by ey the idempotent of the H-class 
H eT. The set of pairs (ey,e)yer constitutes a system of coordinates. Indeed, for 
Hel, 

ee = eH, eeH = e. 


If e = cl is the column-row decomposition of e, then for H € T, ey = cyl, wi hg 3 
CH = eqc, is the column-row decomposition of ey. The notations of Section VI. en 
simplify considerably. In particular, for m € M and H ET, 
m x* H = lmcy = l(emep)c. 
Of course, m x H € Ge. This can be used to define a function 
A* x E(X) > Ge, 


where E(X) = 71 (J) as in the previous section. Let u € A* and let k € E(X). Then 
(k) € J, and corresponding to this element, there is an H-class denoted H (k) inr 
which by definition is the intersection of the R-class of y(k) and of the L-class of e. In 
other words, H®) = Men y(k)M. 

The function from A* x E(X) into Ge is then defined by u * k = y(u) x H™), Thus 


ux k =Ilp(u)eqm = lep(u)e yac. 


Thus u x k € Ge. It is a permutation on the set S = Fix(e) obtained by restriction to 
S of the relation ep(u)e p). 

The following explicit characterization of u * k is the basic formula for the computa- 
tions. For u € A*, k € E(X), we have for s,t € S, 


s(uxk)=tes-uk=t-k (8.6.1) 


In this formula, the c tation of s-uk and t-k is of course done in the automaton 
A(X*). Let us verify ( . If s(uxk) = t, then sey(u)eyu),=t. From se = s, it 
follows that sp(u)epaœ) =t. Taking the image by y(k), we obtain 


sp(u)jenm p(k) = tylk). 


Since epo) p(k) = p(k), we get that sp(uk) = ty(k), or in other words, s- uk = t- k. 
Conversely, assume that sy(uk) = ty(k). Let m € M such that y(k)m = ep). 
Then sy(u)y(k)m = ty(k)m implies sp(u)e pga) = tega). Since se = s and te = t, 
We get 
sep(u)ep(n) = teega), = te =t, 


Ei 
showing that s(u* k) = t. This proves Bt 
The function from A* x E(X) into Ge defined above is called the ergodic representation 
of X (relative to e). 
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1 
We will manipulate it via the relation (Bay. Note the following foyrgulas which are 
the translation of the corresponding relatio S given in Section 2, and which also can be 
simply proved directly using formula ( 
For u € A*,k € E(X), and v € A*, 


uxkv = uxk, (8.6.2) 
uvxk = (uxvk)(u*k). (8.6.3) 


PROPOSITION 6.1 Let X C A* be a thin maximal bifiz code, and let R = E(X) — 
E(X)A™ be the basis of the right ideal E(X). Let e be an idempotent in the min- 
imal ideal of pax») (A*) and let S = Fix(e). The group G(X) is equivalent to the 
permutation group over S generated by the permutations axr, witha E€ A, re R. 


Proof. It suffices to show that the permutations a*r generate Ge, since Ge is equivalent 
to G(X). Set » = yacx~). Every permutation u* k, for u € A* and k € E(X), clearly 
is in Ge. Conversely, consider a permutation o € Ge. Let g E€ G(e) be the element 
giving o by restriction to S,and let u € y~!(g), k € yt (e) Then u k is the restriction 
to S of ep(u)er(n) = ep(uje = g. Thus u* k =v. 

Thus Ge = {u xk | u € A*,k € E(X)}. For u = aiaz2:--an with a; € A, and 
k € E(X), we get, by (4.3), 


ux k = (aq * azaz +: : ank) (ag * a3 +++ ank): +: (an * k). 


Thus G, is generated by the permutations a x k, for a in Aa q k in E(X). Now for 
each k in E(X), there exists r € R such that k € rA*. By ( > we have axk = axr. 
This completes the proof. n 


.4.1 8.6 
Note that Proposition Bat can also be derived from Proposition ipo 


PROPOSITION 6.2 Let X be a finite maximal bifix code over A of degree d and let 
p = pa(x+)- For each lettera € A, we have at € E(X)NX and y(a®) is an idempotent. 


3.1 
Proof. Let A(X*) = (Q,1,1). By Proposition PB- we have af € X fora € A. The 
states 
Lege: lege 


are distinct. Indeed, if leg =1- af for some 0<i<j<d-—l1, th Lsetting p=j-1, 
we would have 1- a’ = (1-a?)- a’, and in view of Proposition E this would imply 
1- a? = 1, whence a? € X* with 0 < p < d, contradicting the fact that a € X and X 
is prefix. Moreover, we have 


Im(a*) = Q- af = {1,1-4,...,1-a7-"}. 


Indeed, let q E Q,q #1, Let w € XA” be a word such that 1-w = q. Since X is 
right complete and finite there exists a power of a, say af, such that wa? € X. Then 
j < d since X is suffix, and j > 0 since w ¢ X. Thus q -af =1landq-at?=1-atJe 
1,1-a,ldots,1-a¢1. This proves that Im(af) C 1,1-a,...,1-a4!. The converse 
inclusion is a consequence of (1 - at) -af = 1 - a? = 1 - af, for i = 0,...,d — 1. 
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Thus g(a?) has rank d, showing that (af) is in the minimal ideal of y(A*), which 
in turn implies that af € E(X). Next (1-a/)-a4=1-a/ for j = 0,...,d — 1 ; thus 
y(a*) is the identity on its image. This proves that (af) is an idempotent. n 

Proposition ‘Shows that in the case of a finite maximal bifix code X, a particular 
ergodic representation can be chosen by taking, as basic idempotent for the system of 
coordinates, the d(X)th power of any of the letters a of the alphabet. More precisely, 
let 

A(X") = (Q,1,1) 
and let y be the associated morphism, set e = (af) with the convention that for 
l<i<d, 
i=1- a! 
The ergodic representation relative to the idempotent (af) is denoted by *a. It is 
defined, for u € A*, k € E(X), and for 1 < i,j < d, by 


HU, bk) =j & i- -uk=j-k 
& 1-aluk=1.aftk. (8.6.4) 


Observe that for u = a and for any k € E(X), 
(=a (8.6.5) 


4 
with a = (12-+-d). Indeed, by (B64) tarak) = j iff i-ak = j-k, thus iff (i+1)-k = j-k. 
Since k induces a bijection from S onto S- k, this implies j = i + 1, which is the claim. 


1 
5 


Sa 
E 
ee 
a 


1 


we 
2 = 
bW 
ae 


Figure 8.11: Transitions for a bifix code. 


EXAMPLE 6.1 Let A = {a,b}, and consider the finite maximal bifix code X C A* of 
degree 3 with kernel K(X) = {ab}. The transitions of the minimal automaton of X*, 
with states {1,2,3,4,5}, are given in Fig. ; 

The letters a and b define mappings y(a) and (b) of rank 3.Thus a,b € E(X). We 
consider the ergodic representation *a, i.e. relative f A ooe e = (a). To 
compute it, it is sufficient (according to Pr sjtion O compute the four permu- 
tations a*a a, a *a b, b *a a, b *a b by using ( . For instance, we have i(a*,a) = j & 
?2— j.a i+1= j mod 3. The permutations are easily seen to be 


ak; Q = A *a b = (123), b *a a = (12), b xa b = (132) 


i-a 


The group G(X) therefore is the symmetric group over S. 
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PROPOSITION 6.3 Let X C AT be a finite maximal bifix code of degree d, and leta € A. 
Then a xa a? is a cycle of length d. 


Proof. By (Sc 


This is equivalent to i-a = j, or i +1 = j mod d. Thus i(a*, a?) = i+1 mod d, proving 
the statement. : 


ila xa af) = j&i- a! =7- at. 


We are now ready to study the groups of finite bifix codes. We recall that a transitive 
permutation group G of degree d > 2 is called a Frobenius group if k(G) = 1. 


THEOREM 6.4 Let X be a finite maximal bifix code of degree d > 4. Then G(X) is not 
a Frobenius group. 


Proof. Let A = (Q,1,1) be the minimal automaton of X*. Since d > 4, no letter is 
in X. Arguing by contradiction, we suppose that G(X) is a Frobenius group. Thus 
k(G(X)) = 1. By Theorem 2.3, we have A C E(X). This means that for all a € A, 
im(a) has d elements. 

Let a € A be a letter, and set S = im(a®) = {1,2,...,d}, where, for 1 < i < d, 
i =1-a’!. Consider the ergodic representation *,, and set 


Q = a *a Q, p = *aa, 


where b € A is an arbitrary letter. We want to prove that 


B=a. (8.6.6) 
Note that, by Eei and BE we have fori € S, 
i- ba = iß - a, 
; i+1 ifi< d 
A { 1 ifi=d 


Since S-b is a minimal image, it contains the state 1. Thus there exists a (unique) state 
qd € S such that q' -b = 1. For the same reason, there exists a unique state q” € S such 
that q” -ba = 1. We claim that q'3 = 1,q” 8 = d. Indeed, we have 1-a = q' -ba = q' 8-a. 
Next q” -ba=q"B-a=1=d-a. 
Since a defines a bijection from S onto itself, it follows that 1 = q'G and q’G = d. 
This proves the claim. 
Now we verify that 
q@>q for qES, qd. (8.6.7) 


First, we observe that the inequality holds for q”, since q’3 = d. Thus, arguing by 
contradiction, suppose that q8 = p < q for some q € S,q#q',q". Then 


qG-a=q-ba=p-a=p+1<q. 
Setting n = q — (p +1), it follows that q- bat! = q. Consider the path 


bar +1 
q — q 
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Since q £ q',q”, we have q-b £ 1,q-ba £1. Also q- bat =p+iF1 fori=1,...,n4+1. 
Thus this path is simple. Consequently, 


al 1(ba”t!)*at alic x 


contradicting the finite eps of X. This proves (Re 

It follows from hat there exists at most one state q E€ S such that qG < q, 
namely the state q’. This implies that the permutation 3 is composed of at most one 
cycle (of length > 1) and the rest are fixed points. Further, 3 cannot be the identity on 
S, since otherwise the relation q’3 = 1 would imply q’ = 1, hence 1-b = 1 and b € X 
which is not true. Now by assumption, G(X) is a Frobenius group. see has at most 
one fixed point. If 3 has no fixed point, then the inequalities in ( are strict and 
this implies that 

6 = (123---d) =a. 
Assume now that 8 has just one fixed point i. Then 8 = (123---i— 1i+1---d)(?). 
This implies that 
Btlea=Gi+1) if iFd, 
Bra=(dl) if i=d. 


Since 67ta € G(X) and 87ta has d — 2 fixed points, G(X) can a Frobemus group 
( : 


only if d < 3. > jel s a contr, iction and proves formula 
It follows from ( and that i- ba = i - a? for i € S. This shows that for 


m > 0, 
1- a”ba = 1- a™t?, (8.6.8) 


Observe that this formula holds for arbitrary letters a,b € A. This leads to another 
formula, namely, for i > 0 and a,b € A, 


ab =1 bH, (8.6.9) 


This formula holds indeed for a,b € A andi = 0. Arguing by induction, we suppose 
that ( olds for some i > 0, and for all a,b € A. en we have, for a,b € A, also 
1-b'a = 1 - attt, whence 1- b'ab = 1-a't1b. Apply ( We get 


1-a't1pb =1-b'ad =1- bt? 


9 
This proves (SSR! 


Finally we show, by a descending induction on i € {0,1,...,d}, that for all a € A, 
Leg AY = {1} 
This holds for i = d, and for i < d we have 


1- at Att = (J 1- abat = Ui pit! qd-t-1 — 1 
bEA beA 


x) 
by using (SCH This proves the formula. For i = 0, it becomes 1- A? = {1}, showing 
that A? C X. This implies that AY = X. Since G(A%) is a cyclic group, it is not a 
Frobenius group. This yields the contradiction and concludes the proof. n 
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Remark Consider a finite maximal bifix code X of degree at most 3. If the degree 
is lor2, then the code is uniform, and the group is a cyclic group. If d(X) = 3, then 
G(X) is either the symmetric group 63 or th qlig roup over 3 elements. The latter 
group is regular, and according to Theorems paa BR code X is uniform. Thus 
except for the uniform code, all finite maximal bifix codes of degree 3 have as a group 
63 which is a Frobenius group. 

We now establish an interesting property of the groups of bifix codes. For this, we 
use a result from the theory of permutation groups which we formulate for convenience 
as stated in Theorem 4.5 (proofs can be found in the classical literature and references 
are given in Section 7). Recall that a permutation group G over a set Q is k-transitive 
it for all (p1,..., pp) € QF and (q1,..., qk) € QF composed of distinct elements, there 
exists g € G such that pig = q1,...,perg = qk. Thus 1-transitive groups are precisely 
the transitive groups. A 2 transitive group is usually called doubly transitive. 


THEOREM 6.5 Let G be a primitive permutation group of degree d containing a d-cycle. 
Then either G is a regular group or a Frobenius group or is doubly transitive. 


THEOREM 6.6 Let X be a ~ maximal bifix code over A. If X is indecomposable 
and not uniform, then G(X) is doubly transitive. 


Proof. According to : 4 Eg EJE group G(X) is primitive. Let pg its degree. In 


view of oo ) contains a peo By, T gopem ; three cases may 
arise. Eith G(X) is o and then, by Theorem A is a group code and _þ i 
Theorem e code X is uniform. Or G(X) is a Frobenius group. By Theorem 

we have d < 3. The only group of a nonuniform code then is G3, as shown in the 
remark. This group is both a Frobenius group and doubly transitive. Thus in any case, 


4.4 


the group is doubly transitive. n 
4.6 baa . : 
In Theorem Bq th tion on X to be indecomposable is necessary. Indeed, 
otherwise by Theorem [df the group G(X) would be jm rimitive. But it is known that 
a doubly transitive group is primitive (Proposition Hisar 


There is an interesting combinatorial interpretation of the fact that the group of a 
bifix code is doubly transitive. 


PROPOSITION 6.7 Let X be a thin maximal bifix code over A, and let P= XA”. The 
group G(X) is doubly transitive iff for all p,q E€ P — {1}, there exist x,y € X* such 
that 


px = yq. 


Proof. Let p be the representation associated with the literal automaton A = (P, 1,1) 
of X*. Let d = d(X), and let e by an idempotent of rank d in y(X*). Let S = Fix(e). 
We have 1 € S, since S = im(e). 

Let p,q E S — {1}, and assume that there exist x,y E€ X* such that pr = yq. We 
have 1-p=p and 1-q=gq, whence 


p-x=1-pr=1-yq=l-q=¢q. 


This shows that for the element ey(x)e € G(e), we have pey(x)e = q. Since ley(x)e = 
1, this shows that the restriction to S of ey(a)e, which is in the stabilizer of 1, maps 
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p on q. Thus this stabilizer is transitive, and consequently the group Ge = G(X) 
is doubly transitive. Assume now conversely that G(X) is doubly transitive, and let 
p,q E€ P—1. Let i,j E€ S be such that pe = i,qe = j. Then i,j #4 1. Consider indeed 
a word w € vy t(e). Then 1-w = 1; thus the assu tion i = 1 would imply that 
p:-w = pe = i = 1, and since 1- w = 1, Proposition Pacis p = 1, a contradiction. 
Since G(X) is doubly transitive, and G(X) is equivalent to Ge there exists g € G(e) 
such that 
ig= J and lg =1. 
Let m E€ y(A*) be such that jm = q, and let f be the idempotent of the group G(em). 
Since e and f are in the same R-class, they have the same nuclear equivalence. Thus the 
equalities ge = j = je imply qf = jf. Further im(f) = im(em). Since gem = jm = q, 
we have q € im(f). Thus q is a fixed point of f, and jf = qf = q. Consider the 
function egf. Then 
legf = lgf =1f =1, 


pegf =igf =jf =4. 


Let x be in y ‘(egf). Then x € X* and p- x = q. This holds in the literal automaton. 
Thus there exists y E€ X* such that pz = yq. n 


7 Examples 


ction5.4 
The results of Section Fishow that the groups of finite maximal bifix codes are particular 
ones. This of course holds only for finite codes since every transitive group appears as 
the group of some group code. We describe, in this section, examples of finite maximal 
bifix codes with particular groups. 
Call a permutation group G realizable if there exists a finite maximal bifix code X 
such that G(X) = G. We start with an elementary property of permutation groups. 


LEMMA 7.1 For any integer d > 1, the group generated by a = (12---d) and one 
transposition of adjacent elements modulo d is the whole symmetric group © q. 


Proof. Let 8 = (1d). Then for j € {1,2,...,d— 1}, 
a Bad = (j,j +1). (8.7.1) 


Next for 1 <<i<j<d, 
(i, j) = TG E ign, 
where 
7=(t,t+1)04+1,14+2)---G—2,7-1). 


This shows that the group generated b and 8 contains all transpositions. Thus it is 
the symmetric group Gg. Formula ( “shows that the same conclusion holds if 3 is 
replaced by any transposition of adjacent elements. 


PROPOSITION 7.2 For alld > 1, the symmetric group Gq is realizable by a finite maz- 
imal bifix code. 
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Proof. Let A = a bor d= 1, oy 2, the code X = A? can be used. Assume d > 3. 
By Theorems in eee exists a unique maximal bifix code X of degree 


d with kernel 
K = {ba}. 


Indeed, u(K) = (Lx,ba) = 2. No word has pe than one K-interpretation. Conse- 


quently K is insufficient and by Proposition he code X is finite. Let us verify 
that 


X Na*ba* = ba U {a*bat* | 1 < i < d—2}Uat"d. (8.7.2) 


For each integer j € {0,1,...d— 1}, there is a unique integer 7 € {0,1,...d—1} su 5 
that aba? € X. It pa to verify that the integer 7 is at ‘i EN ( 


ada {0,1,...,d — 1} be such that abaf € X. By formula ( yin Chapter 
pi the 


number of X-interpretations of atbaf is 
(Lx,abaf) = 1+ |a’ba?| — (A*XA*,a‘ba’) 
= i+j+2-(A*XA*, aba). 
The number (A*XA*,a’ba’) of occurrences of words of X in abaf is equal to 1 plus 


the number of occurrences of words of K in abaf, except when j = 1 which implies 
i = 0 since ba € X. Thus 


i+j if iec {1,2,...,d— 1}, 


ibaj) — 
(Exata) =d iti if i=0. 


On the other hand, the word_a‘bg/.must have d interpretations since it is not in K = 
K(X). This proves formula . Now we consider the automaton A(X*) = (Q,1, 1) 
and consider the e ggodic representation *, associated to the idempotent (a?) defined 


in Section Rs Setting i=1-a*' for i € {1,2,...d} we have 
a xa a? = (12---d). 
.4 
Set 3 = bea“ and observe that 3 = (1d). Indeed by formula pim 


iß=j © 1-a bat = 1. atat 


& 1-a tbat =1-a)7! 


Thus i@ = 1-a tbat. For i = 1, this gives 18 = 1-baa?—!, whence 18 = 1-aù™t = d. 
Next for i = d, we have dG = (1 -aù™t) -at = 1. Finally, if 1 < i < d, then i@ = 
1-a tba} gi! = 1-a'-! =i. This shows that the group G(X) contains the cycle 


a= (12---d 


5. 
and the transposition 8 = (1d). In view of Lemma Aretx j= 
For the next result, we prove again an elementary property of permutations. 


LEMMA 7.3 Letd be an odd integer. The group generated by the two permutations 
a= (1,2,...,d) and y= ðað, 


where ô is a transposition of adjacent elements modulo d, is the whole alternating group 
Aq 
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Proof. The group Mq consists of all permutations o € Gq which are a product of an 
even number of transpositions. A cycle of length k is in 24 iff k is odd. Since d is odd, 
a, y E Ag. ‘oa 
By Lemma Fit tre symmetric group is generated by a and 6. Each permutation 
o € Gg can be written as 
o = aĝas.. a_a" 


and o € U4 if n is odd. In this case, setting n = 2m + 1, 


k k: k 
o =a" baa ° Ba -+ + BomAm41 


with 3; = da*'5. Since bz; = (að) , this formula shows that %q is generated by a 
and dad = y. n 


PROPOSITION 7.4 For each odd integer d, the alternating group Aq is realizable by a 
finite maximal bifix code. 


Proof. Let A = {a,b}. For d = 1 or 3, the code X = Aĉ can be used. Assume d > 5. 
Let E 
ie eee JT = 1123s dy; 
and 
Q=r0} 


Consider the deterministic automaton A = (Q, 1,1) with transitions given by 


ica =i+1 (1<i<d-1), 
d-a =1, 
i-a =i+1 (1<i<d-—2), 
d-l-a =1, 
d-a =d, 
and, 
i-b =i41 (i <3<d—3), 
(d—2)-b =d, 
(d—1)-b =d, 
(d—1)-b =d-1, 
d-b = 1, 
i-b =i+l (2<i<d-1), 
deb =1 


It is easily verifed that this automaton is minimal. Let X be the prefix code such that 
|A| = X*. Since 

I-a=TI, J-a=IT, 

I-b=J, J-b=J, 


the functions (a) and (b), of rank d, have minimal rank. Since J apq J are the 
only minimal images, and since they contain the state 1, Proposition pi ii ii) shows that 
X is maximal bifix code. It has degree d. 
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Let us show that X is finite. For this, consider the following order on Q : 
1<2<2<3<3<---d-1l<d-—-I1 and d-1l<d<d 


For all c € {a,b} and q € Q, either g-c = 1 or g:c >q. Thus, there are only finitely 
many simple paths in A. Consequently, X is finite. 

Now let us compute a: i qojnce (a), (b) have minimal rank, both a,b € E(X). 
According to Proposition b.i, the group G(X) is equivalent to the group generated by 
the four permutations 

a*a, Qžaąab, bxga, bž*ab 


.5 
E ee Pa have ax,a = a*ab = a , with a = (1,2,...d). Next, by formula 


bxaa =a, b*xab=Ņ 


with y = (1,2,--- ,d— 3,d — 1,d — 2, d). In view of Lemma EF GtX) = a 

Observe that for an even d, the group Ma is not realizable. Mor erally, no 
subgroup of 24 is realizable when d is even. Indeed, by Proposition ees group 
G(X) of a finite maximal bifix code X contains a cycle of length d which is not in Wa 
since d is even. 


EXAMPLE 7.1 We give, for d =5,_the figures of the automaton and of the code of the 
previous proof, Figs. and Br 


ee 4 
ae 5 9 
5 3s my 
Nea ee 
= 5O O 
fag ~ 


Figure 8.12: A finite maximal bifix code X with G(X) = As. 


EXAMPLE 7.2 For degree 5, the only realizable groups are Z/5Z,©5 and 5. It is 
known indeed that with the exception of these three groups, 2 transitive permutations 
groups of degree 5 are Frobenius groups. By Theorem “they are not realizable. 


EXAMPLE 7.3 For degree 6, we already know, by the preceding propositions, that 
Z/6Z and G¢ are realizable. We also know that no subgroup of Xe is realizable. There 
exists, in addition to these two groups, another primitive group which is realizable. 
This group is denoted by PG'L2(5) and is defined as follows. Let P = Z/5Z U oo. The 
group PGL2(5) is the group of all homographies from P into P 


zp +y 
ke 
zp+t 
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Figure 8.13: The automaton A(X*). 5_05 


for 2,y,z,t € Z/5Z satisfying zt — yz # 0. Consider, for later use, the permutations 
a = (0001423), 6 = (0010243). 
We have a, 8 € PGL2(5). Indeed a and 8 are the homographies 


Bip Pe 
a: ph —, pre 
Pr p+2 Pe +2 

respectively. We verify now that a and 8 generate all PGL2(5). A straightforward 
computation gives 


Bab = (000421)(3), 6?aB8a~ = (00) (4)(0123) 


The permutation a together with these two permutations show that the group G gen- 
erated by a and £ is triply transitive. Now each element ø in PGL2(5) is characterized, 
as any homography, by its values on three points. Since G is 3-transitive, there exists 
an element g € G which takes the same three values on the points considered. Thus 
o = g, whence o € G. This proves that G = PGL2(5). 

To show that PGLə(5) is realizable, we consider the automaton A = (Q, 1,1) given 
in Table [7.3 This automaton is minimal. Let X be the maximal prefix code such that 
A= A(X*). Then X is a finite maximal bifix code. Indeed, the images 


im(a) = {1,2,3,4,5,6}, im(b) =41,2,3,4,5, 6} 


2.1 
are minimal images, containing both the state 1. By Proposition Fr i ny), X is maximal 
bifix with degree 6. The code X is finite because if Q is ordered by 


1<2<2<3<3<4<4<5<5<6<6, 
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a 


Table 8.5: The Transitions of the Automaton A 


n| = 
A w| N 
Wl Aj vw 
Ol o ke 
Ou Ojo 
= =| o 
w WY] byl 
A OU Ol! 
ou ej BY 
DI FY] ou 
re oj Dl 


NI 


then the vertices on simple paths from 1 to 1 are met in strictly increasing order, 
with the exception of the last one. Next a,b € E(X), because of the minimality of the 
images im(a),im(b). Thus the group G(X) is generated by the permutations 


Q = 64,0 = A *a b= (123456), 


4 
. Formula BE shows that 


y=bega, B=b*ab 


y=a, B= (132546) 


Thus G(X) is generated by a and 3. Let p be the bijection from 1,2,3,4,5,6 onto 
P = Z/5Z U œ given by 


Then a = p-tap = 67t6Bp. Consequently, the groups G(X) and PGL2(5) are equiv- 
alent. 


8 Exercises 


: section5.0 
Section i 
1.1 Let X C At be a maximal prefix code. Let 


R= {r € A* | Vx € X*, Jy € X* : rey € X*}. 


(a) Show that R is a right unitary submonoid containing X*. 
(b) Let Z be the maximal prefix code such that R = Z* and set 


X=Y oZ: 


Show that if X is thin, then Y is synchronized. 
(c) Show that if X = Y’ o Z’ with Y’ synchronized, then 


7° C2, 


(d) Suppose that X is thin. Let A = (Q,1,1) be a deterministic trim automaton 
recognizing X* and let y be the associated representation. Show that a word 
r € A* is in R if and only if for all m € y(A*) with minimal rank, 


1-r =1 mod Ker(m) 
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g section5.1 
Section 


2.1 Let X C A* be a finite code and let A = (Q,1,1) be an unambiguous trim 
automaton recognizing X*. Show that the group of invertible elements of the monoid 
pA(A*) is a cyclic group. 


2.2 Let G be a transitive permutation group over a finite set Q. Let 1 € Q, and 
H = {g € G | 1g = 1}. Let Y : A* — G be a surjective morphism and let Z be the 
group code defined by Z* = W~-1(H). Let X be a finite subset of Z. Let y be the 
representation associated with the minimal automaton A(X*) and set M = y(A*). 

Show that M contains an idempotent e such that the group Ge is equivalent to G if 

there exists a finite subset Y of A* such that 

(i) ZN A*Y A* is finite, 

(i) ZA F(Y*)c X, 

(iii) W(Y*)=G. 
Use this to show that for every finite transitive permutation group G, there exists 
a finite bifix code X such that G is equivalent to Ge for some idempotent e in the 
transition monoid of the minimal automaton of X*. 


. Isection5.2 
Section 


3.1 Let X C At bea bifix code and let A(X*) = (Q,1,1) be the minimal automaton 
of X*. Let y = yy be the associated representation and M = y(A*). Show that 


for any idempotent e € y(F(X)), the monoid of partial functions Me is composed of 
injective functions. 


. section5.3 
Section 


4.1 Let X C A% be a thin maximal bifix code, and let Jp be the minimal ideal of 
pp(A*). Show that for Card(A) > 2, 


H(X) = yp! (Jp) 


. : ignesMax 
Hint: use Exercise [VIBA 


4.2 Let X C A* bea finite maximal prefix code, let A(X*) = (Q, 1,1) be the minimal 
automaton of X*, set p = yacx*). Let a € A and let n be the order of a in X (a” € X). 


(a) Show that the idempotent in p gt) has rank n. 
(b) Show, without using Theorem b.3fthat y(At) is not nil-simple when n > 1 + 
d(X). 


4.3 Let X C At be a thin complete code. Then X is called elementary if there exist 
an unambiguous trim automaton A = (Q, 1,1) recognizing X* such that the semigroup 
S = y4(At) has depth 1. Show that if X is elementary, then 


X=Y 0Z, 


where Y is an elementary bifix code and G(X) = G(Y). 
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4.4 Let A = (Q,1,1) be a complete, deterministic trim automaton and let y be the 
associated representation. Suppose that (A+) has finite depth, and that y(A*) has 
minimal rank 1. Show that the depth of ~(A*) is at most Card(Q)—1. (Hint: Consider 
the sequence 0; of equivalence relations over Q defined by 


p=q mod; iff Vwe A, p-w=q-w.) 


exo5.3.5| 4.5 Let X C A* bea finite bifix code. Let p be the representation associated with 
A(X*) and M = y(A*). Let J be the minimal ideal of M and let A be the set of its 
£-classes. Let Lo be a distinguished £-class in A. Define a deterministic automaton 


B= (A, LoLo) 
by setting 


L-w=Ly(w) 


Let W be the representation associated with B, and let J be the minimal ideal of 
W(A*). 

(a) Show that (4%) hag minimal rank 1, and that PHI) =p 47). 

(b) Use Exercise o Show that y(A*) has depth at most Card(A) — 1. 


Isection5.4 
Section 


exo5.4.1]} 5.1 Let X be a finite maximal bifix code of degree d. Let a € A and k > 0 such that 
a’ € E(X). Show that for each integer n < d—k and each word u € A”, there exist at 
least d — k — n integers 7,1 <i < d such that 


i(uxga") >i-k+1. 


-1.1 5.4.1 
exo5.4.2} 5.2 Derive directly Theorem fen 1} trom Exercise [eet (ake k= 0). 
at 5.4.1 
exo5.4.3] 5.3 Derive the inequalities (Bb from Exercise Bat (take k=1,u =b). 


exo5.4.4| 5.4 Let G be a permutation group of degree d, let k = k(G) and suppose that G 
contains the cycle a = (12---d). Show that if d > 4k? +8k +2, then every m € G which 
is not a power of a has at most d — 2k — 2 excedances (an excedance of a permutation 
m of {1,2,...,d} is a value į such that ir > i). 


| bzn5-4eńbhė3| 5.5 Let X be a finite maximal bifix code of degree d and let k = k(X). Show that if 


d > 4k? + 8k + 2, then 


1. for each a € A and w € A*, the permutation 7 = w *, af 


is in the subgroup 
generated by a = a xa af. 
Hint: show that the permutation ma? has at least d — 2k excedances and use 
Exercise 4.4). 

2. X does not contain words of length less than or equal to k. 


5.4. 5.4.4 
exo5.4.5| 5.6 Derive from Exercises Bafana pa that a finite maximal nonuniform biprefix code 
X of degree d satisfies 


k(X) > (Vd/2) — 1. 
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Table 8.6: A Finite Code with Group GL3(2) 


1 2 8 4 5 6 7 8 9 10 11 12 13 14 15 

a 23 4 5 6 7 1 9 4 #14 15 133 1 6 7 

b 8 9 12 11 10 1 13 9 10 11 12 13 1 12 13 
Table 8.7: A Finite Code with Group M11 

1 2 3 4 5 6 T 8 9 10 11 12 13 14 

a 2 3 4 5 6 7 8 9 10 11 1 22 23 24 

b 12 13 16 17 14 15 #20 #19 18 1 21 13 14 15 


15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 
25 26 27 28 29 21 1 4 5 6 7 8 9 10 ll 
16 17 18 19 20 21 1 14 15 16 17 18 19 20 21 


. section5.5 
Section 


6.1 Let X be an elementary finite maximal bifix code of degree d on the alphabet 
A = {a,b}. Let a = (1,2,...,d), B = b *a a, y = b xa b with the usual convention 
i-a=i+1fo 1<i<d-—1. Show that 8 and y are such that 8 = (ii --- ip) with 
iy <... < ip and y =a’ where 7 is a product of cycles of the form (k,k+1,...,4+m) 
with kG > k+ m. Show that conversely, any choice of @ and y satisfying the above 
conditions defines a finite code. 


6.2 Show that for A = { b}, there are exactly six elementary finite maximal bifix 
codes over A (see Exercise ‘with group equivalent to PGL2(5) 


5.3 
6.3 Show that the automaton in Table Ea defines a finite maximal bifix code X of 
degree 7. Show that G(X) is equivalent to the group GL3(2) of invertible 3 x 3 matrices 
with elements in Z/2Z , considered as a permutation group acting on (Z/2Z)* — 0. 


5.4 
6.4 Show that the automaton in Table Brq defines a finite maximal bifix code X of 
degree 11 . Show that G(X) is equivalent to the Mathieu group M11 . 


9 Notes 
6.7 
Propositions [dis due to Perro (979) 


The theorem of the synchronization of somiaphen codes (Theorem 2 ESE is in|Schutzenberger 


(1964). Thi pey contains also a difficult aie proof of this est 

heorem 3.1 already appears in SETTE (956). Theorem pets Corol AY oe 
ses are from fet ro Reis and Thierrini The ergodic sists of Section i 15 
described in (L979). It is a in (L981) to describe a 


construction of finite maximal bifix codes. Theorem 6.5 is a combination of a theorem 
of Schur and of a theorem of Burnside. Schur’s theorem is the following: “Let G be a 
primitive permutation group of degree d. If G contains a d-cycle and if d is not a prime 
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number, then G is doubly transitive.” This result is proved in (11964), pp. 
52-66. It is the final development of what H. Wielandt calls the “Method of Schur”. 


Burnside’s theorem is the following: “A transitive permutation group of prime degree 
is either doubly transitive or a Frobenius group.” Burnside’s proof uses the theory of 
characters. It is reproduced in (1967), p. 609. An elementary proof (i.e., 
without characters) is in (E982), Vol. III, pp. 425-434. 

The other results of this chapter are from (L975), (L977), (L978). 
(L975) giyes a more exhaustive catalogue of- eyamples than the list of Section 5. 
aes 2.2) is from (1983). Exercise Fat to Sa Exercise 

is a well-known propert of ide iņite” automata (Perles et al (1963). eeienece 

The exercises of Section peso (1978) and those of Section fare trom ee ee aaa 
(1975). The definition of the Mathieu group M11 used in the solution of Exerci mms 
is from (19711). It is a sharply 4-transitive group of order 11 x 10 x 9 x 8. The 
set H is known as the ternary Golay code. 


Excedences of permutations are @ wejl-known notion in combinatorics (see 
((1983))). The result of Exercise FP ae been improved by (£991). He proved 
that if G is a permutation group of degree d and minimal degree n — k, containing the 
cycle a = (12 --- n), for any d such that n > 2kd — k every permutation in G — (a) has 
at most n — d — 1 excedences. he also s ows phat the bound is the best possible. His 
result implies the statement of Exercise 2 taking d = 2k + 1. 
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Chapter IX 


Factorizations of cyclic groups 


0 Introduction 


We describe the links between codes and factorizations of cyclic groups. It happens that 
for any finite maximal code X one can associate with each letter a several factorizations 
of the cyclic group Z/nZ where n is the integer such that a” is in the code X. These 
factorizations play a role in several places in the theory of codes and appear several 
times in the book Theory of Codes. However, they are not presented systematically, as 
we do here. 

W begin. with an introduction to the notion of factorizations of cyclic groups (Sec- 

Kf We th 


tion I). en study how factorizations arise in connection with two special kinds 
i c-bayonets j chooks . SRR 
of words: bayonets (Section ži and hooks (Section B). We will see that factorizations 


of cyclic groups give insight into several properties of codes, like being synchronous or 
being finitely completable. 


1 Factorizations of cyclic groups 


Let G be a group written additively. Given two subsets L, R of G, we write L + R = 
{+r|leL,re R}. The sum L + R is direct if for any element g in G, there exists 
at most one pair (f,r) with £ € L and r € R such that g = £+ r. This means that for 
finite sets L, R, the sum is direct if and only if Card(L + R) = Card(L) Card(R). The 
pair (L, R) is called a factorization if G = L + R and the sum is direct. We also say 
that G= L+ R is a factorization of G. 


EXAMPLE 1.1 The pair (L, R) defined by L = {0,5} and R = {0, 2, 4} is a factorization 
of Z/6Z. More generally, if R is a subgroup of Z/nZ and L is a complete set of residues 
of R, then (L, R) is a factorization. 


The following example illustrates how the coset decomposition may be iterated to form 
more complex factorizations. 


EXAMPLE 1.2 The pair (L, R) defined by L = {0,4,8,9,13,17} and R = {0,3,6} isa 
factorization of Z/18Z. We have actually L = {0,9} + {0,4,8}. Thus {0,4,8} + R is 
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a system of reprentatives of the residues modulo 9. Accordingly, Z/9Z = {0,4,8} + 
{0,3,6} is a factorization. 


EXAMPLE 1.3 Let p,q be positive integers and let L = {0,1} and R = {0,p,q} with 
p<q. The sum L + R is direct in Z if and only ifl1<p<q-1. 


In the sequel, we shall be interested in factorizations of Abelian and, more specifically 
of cyclic groups. Let G = Z/nZ, let L,R be two subsets of G and let U,V C Z be 
sets of representatives of L, R. Then G = L + R is a factorization if and only if for 
each integer k there exists a unique pair i,j with i € U and j € V such that k=i+j 
mod n. 


EXAMPLE 1.4 Let L = {0,3,8,11} and R = {0,1,7, 13,14}. Since the numbers + r 
are all distinct, the sum L + R is direct in Z or in Z/nZ for large enough n. The pair 
(L, R) is not a factorization of Z/20Z because 8+ 13 =0+1=1 mod 20 and so the 
: : : et 
sum is not direct. It is not known whether there exists an integer n and sets L’, R such. or? 
that Z/nZ = L’ + R' is a factorization with R C R’ and L C L’. See also Example Bost 


The following statement gives a useful method to handle factorizations. 


PROPOSITION 1.1 Let G = L+ R be a factorization of a finite Abelian group G. For 


any integer q E€ Z prime to Card(L), G = qL + R is a factorization. 


Proof We may assume that 0 € L, since otherwise we replace L by L’ = L — £ for some 
LE L. If G= qL’ +R is a factorization, then so is (¢qL’+ q@)+R=qLl+R. 

Consider first the case where q = —1. In this case, we clearly have Card(qL) = 
Card(L) and we only need to prove that the sum G = (—L) + R is direct. Suppose 
that —l+r = —l' +r’ with £, V € L and r,r’ € R. Then ¥# +r = +r and thus r = r’, 
l = l’. This proves the result in this case. 

Suppose next that q > 1 is prime. For g = l+ r with 4 € L and r € R, we denote 
A(g) = £ and p(g) =r. 

As a first step, let us prove that for any g € G, the map £> A(g+ £) is a permutation 
of L. For this, let ,é’ € L and assume A(g + 2) = A(g +’). Set g+@=utv and 
gtfl=ut+v' with u€ L and v,v' € R. Then v—¢=v' — ¢ and thus ¢ = V since we 
have just shown that R — L is a factorization. 

We claim that for g € G, there is an x € L such that A(g) = —qz. To prove this claim, 
consider the set T of g-tuples (x1,..., £q) of elements in L such that \(g+21+---+%,) = 
0. For each choice of 21,...,%g-1 in L the map £ > A(gt+a1+-:-+%1+24) isa 
permutation of L. Thus there is a unique x, € L such that (x1,..., £4) € T. Thus T has 
Card(L)?~! elements. Since q is prime, and q does not divide Card(L) we obtain that 
Card(T) = Card(L)?-! = 1 mod q. The set T contains all cyclic shifts of its elements. 
Since q is prime, the number of distinct cyclic shifts of an element of T is either q or 1. 
Since Card(T) = 1 mod q there is at least one t € T such that all its cyclic shifts are 
equal, that is such that t = (a,z,...,2) for some x € L. Since A(g + qx) = 0, we have 
g + qu = p(g + qx) and thus g = —qr + p(g + qx). 

This shows that G = (—qL) + R. Since Card(—qL) < Card(L), the sum is direct 
and thus (—qL, R) is a factorization. By what we have seen above, this implies that 
G = qL + R is also a factorization. 
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Finally, when q > 1 is prime to Card(L), we write q as a product of primes and apply 
iteratively the above argument. n 


EXAMPLE 1.5 Starting with the, factorization L = {0,4,8,9,13,17} and R = {0,3,6} 
of Z/18Z given in Example i we obtain for q = 5 the new factorization given by 
5L = {0,2,4,9,7,13} and R. 


A subset H of a group G is said to be periodic if there is an element g € G \ {e} such 
that g + H = H. We refer to such elements g as periods of H. 
A factorization (L, R) of a group G is called periodic if L or R is periodic. 


EXAMPLE 1.6 The pair (M, S) defined by the two sets M = {0,4,8,9,13,17} and 
S = {0,3,6, 18, 21,24} is a periodic factorization of Z/36Z. Indeed, 18 is a period of 


the set S. 


A group G is said to have the Hajós property if any factorization of G is periodic. 
The integer n is said to be a Hajós number if the group Z/n hag the Hajos property. If 
n is a Hajós number, then any divisor of n is (see Exercise ; a he following example 
shows that 72 is not a Hajós number. 


EXAMPLE 1.7 The pair (L, R) defined by L = {0,8, 16, 18, 26,34} and R = {0, 1, 5, 6, 12, 25, 29, 36, 42, 48, 49,5 
is a factorization of Z/72Z which is not periodic. 


Table 9.1: A non periodic factorization of Z/72Z. 


One may verify that it is indeed a factorization by inspection of Table ri synch 
R is the first row, L the first column and each entry the sum of the elements in 
first row and column (the elements appearing in boldface are those for which the sum 
exceeds 72). Alternatively, we may proceed as follows. Let Ro = {0,6, 12,36, 42, 48} 
and Ry = {1,5,25,29,49,53} be the sets of even and odd elements of R. Let M = 
{0,4,8,9, 13,17}, S = {0,3,6, 18,21, 24} and T = {0, 2, 12, 14,24,26}. Then L = 2M, 
Ro = 2S and Rı = 2T +1. The pairs (M,S) and (M,T) are periodic factorizations 
of Z/36Z (actually, (M, S) is the factorization of Example . Then L+ R= 2M + 
(2S U (27 + 1)) = 2(M + S$) U (2(M@ +T) + 1) and thus (L, R) is a factorization. 


See the Notes for a characterization of the Hajós integers. 
A group G is said to have the Rédei property if for any factorization G = L + R, 
either (L) # G or (R) # G. (We denote by (H) the subgroup of G generated by H.) 


An integer n is called a Rédei number if the group Z/nZ has the Rédei papas Pen 


It can be shown that a Hajós number is a Rédei number (see Exercise 
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eee 
EXAMPLE 1.8 Let Z/72Z = L + R be the factorization of example EJ “Since all 
elements of L are even, the group (L) is contained in the subgroup of index 2 formed 
by the even residues modulo 72. Actually, 72 is a Rédei number (see the Notes section). 


The following example shows that 900 is not a Rédei number. 


EXAM 22qbet n = 900 and let L, H, R be the subsets of G = Z/900Z listed in 
Table ba We will show that G = L + R is a factorization and that (L) = (R) = G. 
Let zı = 225, x2 = 100 and z3 = 36, which are elements of G of order 4, 9 and 25 


{720} 
870 

120 

90 {270} 
[240] [420] 
390 570 


{45} 
870 


120 
630 90 {495} 
[816] [276] [456] 

390 570 


Table 9.2: The sets L, H and R. 


respectively. The orders of £1, £2, £3 are pairwise relatively prime with a product equal 
to 900. Thus G = (x1) + (x2) + (x3). 

Let Li = {0, z1}, Lo = {0, £2, 2£2}, L3 = {0, £3,... , 423} and Hı = (204); Hə = 
(3x2), H3 = (523). We have 


L = Lı + Lə + L3, H = A, + Hə + H3. 


Indeed, the first row of the array giving L in Table Bis Ts the first three rows form 
L3 + Lə and the last three rows form L3 + Lə + z1. The first row of the second array 
is H3, the rows 1,3 and 5 form Hə + H3 and the other ones are obtained by adding 

Clearly, G = L + H is a factorization. We now modify the set H as follows to 
obtain the set R in such a way that 271,272,73 € (R). We first add zə = 100 to 
each element of Hz + 22, (the corresponding elements are marked by ( ) in H and 
R). In this way, the set H’ obtained is still such that G = L + H’. Indeed, we have 
L + Hə + 24, + z2 = Lı + L3 + (x2) + 241 = L + Hə + 221. In a second step, we add 
x3 = 36 to each element of H3 + 6x2 (the corresponding elements are marked [| |). The 
set H” obtained still satisfies G = L + H” for a similar reason as previously. Finally, 
the set R is obtained by adding xı = 225 to each element of Hı + 20x3 (the elements 
are marked with { }). 
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The factorization G = L + R is such that (L) = G and (R) = G. The first equality 
follows from the fact that 71,272,173 E€ L. The second one can be verified as follows. 
Since 523,322 are in R (they already belong to H and have not been modified), we 
have 20x3,6x2 € (R). Since, by construction of R, 20£z3 + zı € R, we have x, € (R). 
Similarly, since 6r2 + z3 € R, we have x3 € (R). Finally, since 27; + x2 is in R by 
construction, we have also x2 € (R). Thus 21, 22,273 E€ (R) and (R) = G. 


2 Bayonets 


In this section, we will see that, under appropriate hypotheses, given a code X c At 
and a letter a € A, the integers i, j such that awa € X* for a € A and w € A* give 
rise to some factorizations of cyclic groups. We begin with the case of w =b € A. A 
bayonet is a word of the form a‘ba” for a,b € A. We begin with the following easy 
result. We say that a pair (L, R) of sets of integers is direct modulo n if €+r =V +r 
mod n, with 0,@’ € L, r,r’ € R implies £ = @’ and r = r’. In other words, (L, R) is 
direct if for any integer m there is at most one pair (¢,r) € L x R such that m= @+4+r 
mod n. This is equivalent to saying that (L, R) is direct modulo n if and only if the 
sum L + R formed with the sets of residues modulo n of L, R is direct. 

Observe that if (L, R) is direct modulo n and L,R are both nonempty, then the 
elements of L (and of R) are distinct representatives of classes of integers modulo n. 

Given a word w and a subset H of N, we write w” for the set {w” | h € H}. 


PROPOSITION 2.1 For L,R CN and n > 1, the set X = a” U aba is a code on the 
alphabet A = {a,b} if and only if (L, R) is direct modulo n. Moreover, the code X is 
maximal if and only if L+ R = {0,...,n— 1}. 


Proof If (L,R) is direct modulo n, then X is a code. Consider indeed a word w 
in X*. We prove that w has a unique decomposition into words in X. Set w = 
a™ba™tb...ba™k for nonnegative integers mo,..., Mp. If k = 0, the word w is a 
unique power of a”. So assume k > 1. For each 7 with 0 < i < k there is a unique pair 
(ri, li+1) € R x L such that m; = ri + li+1 mod n. Moreover, there is a unique 44 € L 
and a unique rg E€ R such that 41 = mo mod n and rk = mą mod n. Thus the unique 
factorization of w is of the form w = yor1y1 °°: £kYyk With x; = aiba", and y; € (a”)*. 

Conversely, assume that X is a code. In order to show that (L, R) is direct modulo 
n, let £, V € L, r,r’ € R such that +r =V} +r’ mod n. There exist an integer k such 
that +r = l +r’ + kn. By symmetry, we may assume k > 0. Then (a‘ba")(a‘ba") 
and (a‘ba™ )(a”)*(a“ ba") are two factorizations of the word a‘ba"tba". Since X is a 
code, this implies k = 0, £ = @’ and r =r’. 

Finally, let m be a Bernoulli distribution over A and set 7(a) = p. Then m(X) = 
pe (l= P)O rer? Erer): Thus 7(X) = 1 if and only if Deer rer P = 
1+p+---+p"!, and this holds if and only if L + R = {0,...,n—1}. n 


The pairs (L, R) such that (L, R) is direct modulo n and L + R = {0,...,n — 1} are 
precisely the pairs such that every integer in {0,...,n — 1} has exactly one decom- 
position of the form £ +r with L € L,r € R. These pairs define particularly simple 
factorizations which are described in Exercise [I.2} 
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EXAMPLE 2.1 For n = 6, the pair composed of L = {0,1} and R = {0,3,5} is direct 
modulo n. The set X = a” Uatba® is {a®, b, ab, ba, aba?, ba®, aba}. 


If X is an arbitrary finite maximal code on A = {a,b}, the set of bayonets contained 
in X may not have the form above since the set of pairs (£, r) such that afba” € X for 
some a,b € A needs not even to be a cartesian product. 

Let X be a code and a be a letter such that a” € X for some integer n > 1. For 
a word w, we denote by C(w) the pairs of residues modulo n of integers 7,7 > 0 such 
that awa € X*. Recall that, given a finite maximal code X, the order of a letter a is 
the integer n > 1 such that a” € X. The order exists for each letter. 

We start with a useful observation. 


LEMMA 2.2 Let X be a finite maximal code over A, and leta € A be a letter. For any 
w € A*, one has a*wa* N X* Æ 0. 


Proof. Since X is finite and maximal, it is complete. Let Z be the maximal length of a 
word in X. The word a‘wa* is completable, thus ua’wa‘v € X* for some words u,v. By 
the definition of £, there exist integers i, i’, j, 7’ such that ua” awat, afv € X*. n 


PROPOSITION 2.3 Let X be a finite maximal code on the alphabet A. Leta € A be a 
letter and let n be the order of a. For each word w € A*, the set C(w) has exactly n 
elements. 


Proof Let £ be the maximal length of the words of X and let kn > 24. For each r with 
0 <r <n, we show that there is a bijection from the set C(wa’t*"w) onto the set of 
pairs of elements in C(w) of the form (i, p), (q, j) with p+ q = r modulo n. 

In a first step, we show that for each (2,7) € C(wa’+*"w) there is a well defined pair 
(p,q) of residues modulo n such that (7, p), (7,7) € C(w) and p+ q=". 

Indeed, consider a pair (i, j) of representatives of (7,7) € C(wa’+*"w). Then awat wad € 
X*. By the choice of k, there exist integers p,q such that awa? alwaf € X* and 
p+q=rt+kn. 

Observe that if p',q' are such that awa?” atwa € X* and p' + q = r + kn, then 
assuming for instance p’ > p, one has a?’-P € X* since X* is stable. Thus p= p 
mod n and also q = q! mod n. Consequently, the pair (p,q) is well defined by the pair 
(i, j). 

Next, if i’ =i mod n and j' =j mod n and let (p’,q’) be the pair corresponding to 
(i', 9’). If for instance i’ > i then a’ wa? = a’ ~‘a'wa? is in X* and consequently p' = P. 
This defines a mapping (2,7) > (7, p),(7,q) with p+ q =F. 

This mapping is clearly injective. We prove that it is surjective. Indeed, consider a 
pair awa? awa € X* with p+q=7. If p > £, then a’wa?—”" € X*. Thus we may 
assume p < £ and also q < 4. There is an integer t such that p + q + tn = r + kn, 
and actually t > 0 because tn = r+ kn—p-—q > r+kn-—2 >r > 0. Thus 
(a'wa? )a'n(atwa?) = awa" tt” wal is in X* and 7) is in C(wa™t*"w). te 

Let c(w) = Card(C(w)). By Lemma EA we have c(w) > 0. From the bijection, it 
follows that 


n-1 


c(w)? = ‘> c(wa tw). 


r=0 
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Now we prove that c(w) =n for all w € A*. Recall that 0 < c(w) < n?. Let w be such 
that c(w) is minimal. Since Da c(wa"trw) > ne(w), we obtain c(w)? > ne(w) and 
thus c(w) > n. Next, let w be such that c(w) is maximal. We have Da; c(wa" tw) < 
nc(w) and thus c(w) < n. 


EXAMPLE 2.2 Let X = {aa, ba, baa, bb, bba}. There are four distinct sets C(w) with 
respect to the letter a, namely C(a) = {(0, 1), (1,0)}, C(a”) = {(0,0), (1,1)}, C(b) = 
{(0,0), (0, 1)} and C(ab) = {(1, 0), (1, 1}. 


THEOREM 2.4 Let X be a finite maximal code. Let y : A* — M be the morphism from 
A* onto the syntactic monoid of X* and let K be the minimal ideal of M. Let a be a 
letter and let n be its order. For u,v € A*, let 


R(u) = {i > 0 | uate X*} Liv) = {j > 0 | avA*n X* #0} 


and let R(u), L(v) denote the sets of residues mod n of R(u), L(v). If u,v € THK) 
and u is right completable in X*, then Z/nZ = R(u)+L(v) is a factorization. Moreover, 
Card(L(v)) is a multiple of the degree of X. 


Recall that a word u € A* is called right completable in X* if there is a word w 
such that uw € X*. A word u € A* is called strongly right completable (with respect 
to some code X) if any word in uA* is right completable in X*. A word u is called 
simplifying if for any x € X* and v € A*, x, xuv € X* implies uv € X*. Clearly, the 
sets of strongly right completable and of simplifying words both are right ideals. 


PROPOSITION 2.5 Let X C At be a thin maximal code. Let py : A* — M be the 
morphism onto the syntactic monoid of X* and let K be the minimal ideal of M. 
Then any right completable word u € p7!(K) is both strongly right completable and 
simplifying. 


Proof To show that u is strongly right completable, observe that the right ideal y(u) M 
is minimal and consequently, for every m = (v) € M there exists m’ = p(w) such 
that y(u)mm’ = y(uvw) = y(u). Since u is right completable, this shows that uvw is 
right completable. Thus u is strongly right completable. 

To show that u is simplifying, suppose first that u € X*. Let x € X* and v € A* be 
such that xuv € X*. Let m = y(u), p = y(x) and q = y(v). Then mpm belongs to the 
same group G as m. Let n be the inverse of mpm in G. Note that, since G is a finite 
group, n is a power of mpm and thus n € y(X*). We have mpmn = nmpm = e where 
e is the idempotent of G and thus m(nmpm)q = meq = mq. Hence mq = mnmpmq = 
(m)(n)(m)(pmgq) is in y(X*), and uv € X*. This shows that u is simplifying in this 
case. 

In the general case, since u is right completable, uA* N X* #4 Ø. Let y € uA* N X*. 
Then y(y) € K, thus the word y is simplifying by the preceding proof. Since the right 
ideal y(u)M is minimal, there exists v € A* such that y(yv) = y(u). To show that u is 
simplifying, consider x € X* and t € A* such that xut € X*. Since y(yv) = y(u), one 
has xyvt € X*, and since y is simplifying, one gets yvt € X*. Since y(ut) = y(yvt), 
this in turn shows that ut € X*. Thus u is simplifying. m 
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Proof of Theorem bra Consider an integer r > 0 and let k be such that kn is larger 
than the maximum of the lengths of the words of X. 

Since u is strongly right completable, there is a word w such that ua™t*"vw e€ X*. 
By the hypothesis on k, there exist i, j with r + kn = i + j such that ua’,a/vw € X*. 
By definition i € R(u), j € L(v). This shows that R(u) + L(v) = Z/nZ. 

Let us now show that the sum is direct. Let i,i’ € R(u) and j, j° € L(v) be such 


Figure 9.1: Proving that the sum is direct. 


that i +j = i +j) modn. We may assume that i+ j < +’. Le E 2 0 be 
such that i+ j + kn = i' +7’. Thus uattitry = ua”tiv (see Figure ae 
j! € L(v), there is a word w such that af vw € X*. Since. į L(V), the word afv is 
right completable and thus simplifying by Proposition 2.5. We have ua’t*” € X* and 
ua'tkrajyw = (ua®)(at vw) € X*. Thus alow € X*. 

Since uat, ak" +3 yw, ua” , af'vw € X* and X* is stable, we have, assuming for instance 
that i! >i, a’ ~i € X*. Thus i=?! mod n and also j = j’ mod n. 

Finally, for w € A*, let 


S(w) = {j >0| awe X*} (9.2.1) [eqs 


and let S(w) denote the set of residues of the elements of S(w). Let e = y(x) be an 
idempotent in K N y(X*). Let G = eMe be the group containing e and H be the 
subgroup GNM y(X*). Let G = u4 H gi be the decomposition of G into right cosets of 
H and let w; € y1(g;") for each i=1,...,d. 

We claim that L(v) = U2} S(vw;) and moreover the sets S(vw;) are disjoint and 
nonempty. First, consider j € S(vw;). By definition, aJuw; € X* and thus j € L(v). 
Moreover, we have also ey(alv)g;* € H and thus ey(av)e € Hg;, showing that the 
index 7 is uniquely determined by j. 

Conversely, let j € L(v). Then since ey(a’v)e € G, there is an index i such that 
ev(ajuje € H di pish implies ey(aJuw;) € X*. The word afv is simplifying by 
Proposition Peg Hence ovu; € X*, showing that j € S(vw;). 

Let 

N(u) = {i > 0 | A*uat N X* #0} (9.2.2) [eqn 


and let N(u) denote the set of residues modulo n of the elements of N(u). There is, 
by symmetry, an analogue factorization Z/nZ = N (u) + S(x") for each u’,v’ € yp !(K) 
with v’ left completable. Since for each w;,i = 1,...,d, the word vw; is left completable, 
one gets d factorizations Z/nZ = N(u)+S$(vw;). In particular all sets $(vw;) have the 
same number s of elements. Thus Card(L(v)) = SX“, Card(S(vw;)) = ds is a multiple 
of d. n 
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Evidently, there is a symmetric statement ,usjng left completable words, using the 
sets N(u) and S(v) defined by and a namely: if u,v € y'(K) and v 
is left completable, then Z/nZ = N(u) + S(v) is a factorization and Card(N(u)) is a 
multiple of d. 

The previous theorem has a close connection with Theorem IX.1.3 and the factor- 
ization of the polynomial 1 — X for a finite maximal code X. Actually, according to 
Lemma IX.8.3.3, there are polynomials P, Q, R with coefficients 0,1 such that 


A* = PX*Q+R. 
Taking b = 0 for all letters b Æ a, we obtain 
a* = U(a")*V+W 


for some polynomials U,V,W C a* with coefficients 0,1. Multiplying both sides by 
a” — 1, we obtain 
1+a+... +a! SUV Wie 1) 


or 
UV =1+a+... +a"! mod (a”-—1) 


which is equivalent to U = a’, V = a® with (L, R) a factorization of Z/nZ. 
We illustrate this statement in the following example. 


EXAMPLE 2.3 Let A = {a,b} and let X = (A? \ a?) U a3 A? which is a finite maximal 
prefix code of degree 3 (the lengths of the words of X are 1 ee 3). The transitions 


of the minimal automaton of X* are represented on Table etu =v =b. The sets 
0 1 2 3 4 5 
a|l 2 3 4 50 
b6|4 5 0 450 


Table 9.3: The minimal automaton of X* 


S(b) = {j > 0 | afb € X*} and N(b) = {i > 0 | A*ba’ N X* Æ Ø} satisfy S(b) = {2,5} 
and ÑN (b) = {0,1,2}, giving a factorization of Z/6Z such that Card( N (b)) = 3. 


torGene 
Theorem = takes a simpler form when X is synchronous. We give here a direct 
proof, but the proposition follows from the theorem in the case where the words 2, y 
are taken in the inverse image of the minimal ideal of the syntactic monoid. 


PROPOSITION 2.6 Let X be a finite maximal synchronous code. Leta € A and letn > 1 
be its order. Let x,y E€ X* be a synchronizing pair. Let R(y) = {r > 0 | ya” € X*} and 
L(x) = {L > 0 | afz € X*}. Let L, R be the set of residues modulo n of the sets L(x), 
R(y). Then (L, R) is a factorization of Z/nZ. 


Proof. Recall that yA*x C X*. Let w = ya”x with u greater than the maximal length 
of the words in X. Then there is a pair r,@ of integers such that ya”,afx € X* and 
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u =r +£. This proves that N = R(y) + L(x), and consequently that ng =L +R. 
The fact that the sum is direct is proved as in the proof of Proposition B1. n 


We illustrate the proposition in the example below. 


EXAMPLE 2.4 Let A = {a,b}. Consider the maximal prefix code X = (A? \ b?) Ub? A 
and the maximal suffix code Y = A?aUb. Then X* N Y* is generated by a finite 
maximal code Z which satisfies 


Z—-1=(1+a+b+b((A-—1ja(A-—1)+A-1)(1+a+a?+ ba). 


(see Exercise VIII.1.5). We have af € Z. The word x = ab?a is synchronizing for X 
and the word y = b? is synchronizing for Y. Thus we have yA*x C yA* N A*r C Z*. 
We have L(x) = {2,5}, R(y) = {1,3, 5}. 

We obtain the factorization ({0, 3}, {0,2,4}) of Z/6Z, in which both factors are peri- 
odic. In fact, L = {0,3} has period 3 since a? € Y and R = {0,2,4} has period 2 since 
ae xX. 


torGene 
A consequence of Theorem 2 is the following statement (appearing as Theorem VI.3.6 
with a proof using probability distributions and which can also be obtained as a con- 
sequence of Theorem IX.1.3). 


PROPOSITION 2.7 Let X be a finite maximal code on the alphabet A. The degree of X 
divides the gcd of the orders of the letters. 


torGene 
Proof. Let a be a letter and let n be its order. According to Theorem a there exists 


a factorization Z/nZ = R + L where Card(L) is a multiple of the degree d of X. Since 
Card(L) divides n, d divides n and the result follows. n 


In particular if the gcd of the orders of the letters is 1, then the code X is synchronous. 


3 Hooks 


A hook is a word of the form atb for some letters a,b and integers i,j > 0. In this 
section, we will show that, under adequate hypotheses, the hooks contained in a finite 
maximal code define factorizations of the cyclic groups Z/nZ where n is the order of 
some letter. 


THEOREM 3.1 Let X be a finite maximal code on the alphabet A and let a,b € A be 
such thatb € X. Letn > 1 be the order of a, and let 


L={£>0| abt nx #0}, R={r>0|bta™ nx £9}. 


Let L, R denote the sets of residues modulo n of L, R. Then (L, R) is a factorization 
of Z/nZ. 
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Proof Let k > 1 be larger than the length of the words of X. Then, since b € X, we have 
bk A*bk C X*. Thus, for any i > 0, the word w = bka‘tk"d* is in X*. This implies that 
there exist integers p,q,r, such that w € b*(bPa‘)(a”)*(a"b1)b* with bPa, abt € X. 
This shows that i =r+£ mod n. 

The decomposition of 7 is unique. Suppose indeed that r+ @=r’ + ¢ + tn for some 
integer t (with t > 0, the other case is symmetric) with r,r’ € R and £, V € L. Let p',q' 
be such that P'a”, ab! € X*. Then the word b¥a+"b* has the two factorizations 


DEP (BP a’) (a b2)bE-4 = EP (BP a” Jal” (a bET, 


Since X is a code, these factorizations are the same, and p = p’, 4 = ¥', r = r’ and 


q=. , 


EXAMPLE 3.1 Let X = {aaaa, ab, abaa, b, baa}. Then n = 4 and 
L= {0, 1}, R= {0, 2} 


=RSS torGene -factorGene 
It is possible to obtain egrgm 2 i as a corollary of Theorem Pi {see Exercise E i 


One may use Theorem O prove that some codes are not contained in a finite 
maximal one. 


PROPOSITION 3.2 Let L,R CN withO € R,L and n> 1 be such that the pair (L, R) 
is direct modulo n and Card(L),Card(R) > 2. If n is a prime number, then X = 
a” UařbU ba? is a code which is not contained in a finite maximal code. 


Proof The fact that X is a code follows f ny deroposition | Beret Y be a finite maximal 
code containing X. Then, by Theorem e sets R, L of residues modulo n of R, L 
are contained in sets R’, E which form a oriei of Z/nZ. Since (L, R) is direct, in 
particular Card(R) = Card(R) and Card(L) = Card(L) Thus n = Card(R’) Card(L’) 


is nontrivial factorization of n, a contradiction. n 


2-nonfcompletable| EXAMPLE 3.2 The set X = {ař, b, ab, ba?} is a code which is not contained in a finite 


maximal code. 


EXAMPLE 3.3 Let X = ba™ Ual?®}ba®2 U al ba®s with Rı = {0,1,7,13,14}, Ro = 
{0, 2,4, 6}, R3 = {0,1,2}. The set X is an example of a code which is not commutatively 
prefix (see Example VII.6.4). 

It is not k powhether X is contained in a finite maximal code. If it is the case, 
by Theorem E exists an integer n and sets R,L such that Z/nZ = R + 
is a factorization with {0,3,8,11} C L and {0,1,7,13,14} C R (see Example 
This implies that n is not a Rédei number since 0,1 € R and 0,3,8 € L and thus 
(L) = (R) = Z/nZ. =replacement 

It is easy to see, using Proposition E that soch an integer n is a multiple of 330 = 
2x3x5x11. Indeed, if n were not divisible by 3, then R+3L would be a factorization, 
a contradiction with the fact that 3 is in R and in 3L. The same argument shows that n 
is divisible by 2 and 11. Finally, if n is not divisible by 5, then L+5R is a factorization, 
a contradiction with the fact that 8 = 3 + 5 = 8 + 0 has two decompositions. 


= hor 
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A factorization L,R of Z/nZ is a Sands factorization if there exist two relatively 
prime integers p,q which are not multiples of n such that 0,1 are in one of the factors 
L or R and 0,p,q are in the other factor. The hypothetical factorization discussed in 
the previous example would be a Sands factorization. 

The following example shows that there exists a Sands factorization where, in addi- 
tion, p is prime. 


. : N snotRedei 
EXAMPLE 3.4 We start with the factorizat on Get of Example i J where n = 900 
ba Since 36 


and the sets L and R are given in Table 1 is an element of L prime to 900, 
thus invertible modulo 900. It is easily checked that @ = 541 is its inverse. Since £ is oe ener 
prime to 30, setting U = 4L, G = U + R is still a factorization by Proposition fees 
and 0,1 € U. It remains to replace R by an appropriate factor. For this, consider the 
elements r = 45 and s = 96 of R. In the factorization G = U + R, the factor R can 
be replaced by R — r to get the factorization G = U + (R — r), and 0 € R — r. Next 
96 — r = 51 = 3 x 17 is in R— r. Since 17 is relatively prime to 900, it is invertible 
and its inverse is 53. Since . = 53 is relatively prime to Card(R — r) = 30, in the 
factorization G = Us Ror) r), we may replace the factor R — r by “i — r) again by 
Proposition e in th the factorization a U +V with V = m(R — r) which 


satisfy the conditions with p = 3 = m(96 — r) 900 and q = 65 = m(250 — r) 
mod 900. The sets U,V are represented on N 1s factorization is a Sands 


Table 9.4: The sets U and V with 0,1 € U and 0,3,65 € V. 


factorization since 0,1 € U and 0,p,q € V with p = 3 and q = 65. 


A multiple factorization is defined as follows. For an integer d > 1, a d-factorization 
of a group G is a pair (L, R) of subsets of G such that each g € G can be written in 
d different ways g = €+r with £ € L andr € R. Thus an ordinary factorization is a 
1-factorization. 

The concept of multiple factorization can be extended to the case of multisets (L, R). 
We say that (L, R) is an m-factorization of Z/nZ if each element of Z/nZ can be written 
in m different ways as the sum modulo n of an element of L and an element of R, with 
the multiplicity taken into account. 

For example, L = {0,0,1,5}, = 40; 2,4} forms a 2-factorization of Z/6. 

A generalization of Theorem 1s the following. 


PROPOSITION 3.3 Let X be a finite maximal code on the alphabet A. Let a,b € A and 
let n,m > 1 be the integers such that a",b™ € X. Let R,L be the multisets 


L={@>0|a°tt'n XAG}, R={r2>0| bra" NX 4H}. 
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Let L,R be the multisets of residues modulo n of L,R. Then the pair (L,R) is an 
m-factorization of Z/nZ. 


PS 
Proof We use Proposition Ey Let k be the maximal length of the words of X. Let 
s > k. By Proposition 2.3, there are m pairs of residues modulo m of integers i,j > 0 
such that b’a°b! € X*. Thus s is the sum in m ways of integers r, / such that bta”, abi € 
XT 7 


EXAMPLE 3.5 Let X = {aa, ba, baa, bb, bba}. Then n = m = 2 and L = {0}, R= 
{0,1,1,2}. The statement is satisfied since 0 and 1 are obtained each in two ways as 
the residue of an element of R. 


maximal code (see Exercise : 


torLam i : i 
One may use Proposition ni a aprove that some codes are not contained in a finite 


4 Exercises 


. section5bis.2 
Section 


1.1 Let L, R C {0,1,...,n — 1} and consider the polynomials in the variable a 


a= o, =y g 


LEL reR 


Show that if (L, R) is a factorization of Z/nZ, then a” — 1 divides a’a®(a — 1). 


1.2 Let n > 1 be an integer and let p(a),q(a) be two polynomials in the variable a 
such that 


Show that if p(a),q(a) have nonnegative coefficients, there exists a sequence no = 
1,n1,...,N41 =n of integers such that n; divides n;41 for 1 < i < k such that p,q are 
obtained by grouping in two parts the factors of the product 


a” —1 a” —1 a™ — 1 


a—1 ark a—1 ` 


1.3 Show that all divisors of a Hajós number, are also Hajós numbers. 


1.4 Prove that a Hajós number is a Rédei number. 


1.5 Show that if Z = L + R is a factorization of Z with L finite, then R is periodic. 
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g section5bis.3 
Section 


=RSS . torSynchro 
2.1 Deduce Theorem rom Proposition [2.0f 


2.2 Let m,n > 1, let (H, K) be an m-unambiguous pair such that m € H, K and let 
(S,T) be an n-unambiguous pair such that n € S,T. Show that 


X =a” Ub” Ure Ug =a =i" (9.4.1) 
is a code. 


2.3 Let d,t > j > 0 and let m = dt + j. Show that for any n > 1, when (S,T) is 
a Weee of Z/nZ and Card(H) = d, Card(K) = t, the code defined by Equa- 


tion is not contained in a finite maximal code. 


~Lam2 
2.4 Use Exercise Bato show that the code 
Y = {a° ba?, ba", b a7 a baba ab} 


is not contained in a finite maximal code. 


2.5 Show that if (L, R) is a factorization of Z/nZ where n is a Hajós number, then 
the code a” U a’ba® is composed of prefix and suffix codes. 


5 Notes 


Factorizations of cyclic groups or, more generally of Abelian groups, form a subject with 
a respectable history, beginning with the proof by G. Hajós in 1941 of a conjecture of 
Minkovski. The recent book by S. Szabó (2004) is recommended for an exposition 
of this subject. Two important results in this theory are the theorems of Hajós and 
Rédei. The first one asserts that if G = A, +... + An is a factorization of a finite 
Abelian group G where each A; is a cyclic subset, then at least one of the factors must 
be a subgroup of G (a cyclic subset is of the form 0,a,2a,...,ra for some a € G and 
r > 1). The second one is a generalization of Hajós theorem proved by L. Rédei 
(1965). The theorem says that if G = A,+...+ An is a factorization of a finite Abelian 
group G such that each A; has a prime number of elements and contains the neutral 
element, then at least one of the factors must be a subgroup of G. 
The link between codes and factorizations of cyclic groups was first noted by Schtitzenberger 


(19794). - ae ee 

Proposition ET e due to Sands (2000). Example (Cys a counterexample to 
a conjecture of Hajós due to De Bruijn (1953). The Hajós numbers are known 
exactly. An integer n is a Hajós number if and only if it is a divisor of one of the 
form p%q, p2q’, p?qr or pars with a > 1 and p,q,r distinct prime numbers (see Bzabd 


lex=anotRedei 


(2004)). The least integer n which is not a Hajós number is thus n = 72. Example [1.9 
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is due to Szabó (L985). The list of Rédei pene is also known exactly. It is 
formed of the divisors of integers of the a pigr, p° grs par st, where p,q,r,s,t are 
distinct primes and a,b > 1 Bzabd (2006). tase he = is due to A. Sands. It is a 
counterexa to a conjecture arene in 1] (L989). 


ras 


Exercise [isa reslt off 1937) 
Exercise is a result due to Hajós (see (2004)) p. 165 and also 


(1977). The optimal bound on the period of R is not known (see Bzabd (2004) for an 
example where the period is quadratic in the size of R). 


Proposition augsult from Perrin and Schiitzenberger P 
(1976). Theorem lags" from Restivo, y En and Sar fone i. 
while Proposition is due to Lam [Lan] (L996 Toi is from Lam I 

His result generalizes one of De Felice D L996) (L996) who proved the same result for 
a code X of the form X = a” U atb U ba”. For i smaller class De Felice also proved 
in (1996) that X is included in a finite maximal code with the additional 


property that for each word in X there are at most two occurrences of the letter b. 
The references to Theory of Codes are as, follo Propasition 1S pro osed a as 
-non Zom 
Exercise 6.3 of Chapter VIII. Example pe Example [.5.6. Proposition 15 sta ed 
for prefix codes as Theorem 6.4 of Chapter II in [Berstel and Perrin (L985). It is proved 


in Chapter VI in the general case using probability measures (Theorem VI.3.6). 
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Chapter X 


Densities 


0 Introduction 


We have used measures for studying codes as early as in the first chapter. In particular 
we have seen that for any code X over an alphabet A and for any Bernoulli distribution 
m over A*, we have 7(X) < 1. We also have seen that a thin code X is a maximal code 
iff 7(X) = 1. We thus have obtained an astonishingly simple method for testing the 
maximality of a code. This is a mark of the importance of the role played by probability 
measures in the theory of codes. 

In this chapter we again use measures. We will study asymptotic properties of certain 
sets related to a code. For this the notion of density of a subset L of A* is introduced. 
It is the limit in mean, provided it exists, of the probabilit t a word of length n 
is in L. We shall prove a fundamental formula (Theorem j that relates the density 
of the submonoid generated by a thin complete code to that of its sets of prefixes and 
suffixes. For this, and as a preparation, we shall see how one can compute the density 
of a set of words by transferring the problem to a study of probabilities in abstract 
monoids. 


1 Densities 
Let 7 be a positive Bernoulli distribution on A*. It is a morphism 
a: A* >]0, 1] 


such that 


`> ra) =. 
In the sequel, we use the notation 
A™ = {1}UAU---U AE, 
In particular AO = 9, AY = {1}. 
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Let L be a subset of A*. The set L is said to have a density with respect to m if the 
sequence of the m(L N A”) converges in mean, i.e., if 


n—1 
1 
lim — ) a(LM AF) 
noo n 
k=0 


exists. If this is the case, the density of L (relative to 7) denoted by 6(L), is this limit, 
which can also be written as 


6(L) = lim (1/n)a(LN A™) . 


n— oo 


An elementary result from analysis shows that if the sequence m(L N A”) has a limit, 
then its limit in mean also exists, and both are equal. This remark may sometimes 
simplify computations. Observe that 


5(A*) =1 


and 
0<<d(L) <1 


for any subset L of A* having a density. If L and M are subsets of A* having a density, 
then so has LU M, and 
(LU M) < 6(£L)+6(M) . 


If LAM = 0, and if two of the three sets L, M and LU M have a density, then the 
third one also has a density and 


(LU M) = 6(L)+6(M) . 


The function 6 is a partial function from §(A*) into [0,1]. Of course, 6({w}) = 0 for 
all w € A*. This shows that in general 


5(L) # X d({w}) . 


wel 


Observe that if 7(L) < co, then 6(L) = 0 since 
(LAA) < x(L) 
whence 
i 
=n(LN AM) 30. 
n 
EXAMPLE 1.1 Let L = (A?)* be the set of words of even length. Then 
mL AP) = (LA ACD) = k 


Thus 
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q=7(b). Then (see Example 


[ex6.1.2] EXAMPLE 1.2 Let D* = {w te dla = |w} over A = {a,b}, and set p = z(a), 


2 
a(D* N a” = ( "yp a(D* N ARTEN = 0 
n 
Using Stirling’s formula, we get 
1 
D* nN A?” a 4” n 
nm )~ "(0 


showing that for all values of p and q, 


lim m(D* N A”) = 0 


n—-CoO 
Thus 6(D*) = 0. 


The definition of density clearly depends only on the values of the numbers 7(LN A"). 
It appears to be useful to consider an analogous definition for formal power series: we 
restrict ourselves to formal power series in one variable with coefficients in R+. Let 


f=>0 ft” € Ry [fe] 
n>0 


be a formal series. The density of f, denoted by ô( f) is the limit in mean, provided it 


exists, of the sequence fn, 
n-1 


6(f) = lim — 2 


To each set L C A* we associate a series fzr € R4[ft]] by 


fr= So anae (10.1.1) 
n=0 


Clearly fz has a density iff L has a density, and 


6(L) = o(fr) - 


1 
Let fz be given by RTT: We denote by pz the radius of convergence of the series 
fi. It is the unique real number p € R+, such that 


Sob An)2" 


n>0 
converges for |z| < p and diverges for |z| > p. For any set L, we have py > 1 since 
m(LAA”)<1 


for all n > 0. a l 7 = 
The following proposition is a more precise formulation of Proposition test 
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PROPOSITION 1.1 Let L be a subset of A* and letm be a positive Bernoulli distribution. 


If L is thin, then pr, > 1 and 6(L) = 0. 


Proof. Let w be a word in F(L) and set n = |w|.Then we have, for 0 < i < n and k > 0, 


LiyvAl (Anca A= {wp . 


Hence 
mL A‘(A”)*) <(1- m(w))* : 


Thus for any p € R+ satisfying (1 — 7(w))p” < 1, we have 


n—-1 co 
file) <>) > 0d E e 
i=0 k=0 
= Sot Soca neon 
i=0 k=0 


This proves that 


1 1/n 
> | ————_ I; 
o> (Ty) j 


For later use, we need an elementary result concerning the convergence of certain 


series. For the sake of completeness we include the proof. 


PROPOSITION 1.2 Let f = p> fnt”, g = nso gnt” E R4 [ltl] be two power series 


satisfying 


(i) 0 < o = J n>0 9n < % 
(ii) for alln >0,0< fn <1. 
Then (f) exists iff (fg) exists and in this case, 


(fg) =d(fje . 


Proof. Set 


Then for n > 1, 


og 


l 

AAE TN 
Mi 
Sh 

eee 
Me 
Ss 


= 720 
n-1 lee) 
= 9 figi + X fi X Jj 
O<itj<n-1 = imne 


n—-1 


n-1 
= oe hk + F Sta 


(10.1.2) 
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where r; = yo gj. Let sn = Se fitn—i. Then for n > 1, 


n—-1 n-1 
(2 > s) c= (: z m) + = $n (10.1.3) 
1=0 k=0 


Furthermore 
n 


n—-1 n-1 
Sn = yas = eee = Y (10.1.4) 
i=0 i=0 


i=1 


Since ` gn converges, we have lim;_... ri = 0, showing that also 


n—1 
1 
lim => r; =0 
n=œ n 4 
i=1 


4 
and in view of RET, 


n>œ n 
3 
Since o Æ 0, Eq. RDE) shows that (f) exists iff 6(h) exists and that 6(f)o = d(h). 
This proves ( and the proposition. m 


PROPOSITION 1.3 Let be a positive Bernoulli distribution on A*. Let L, M be subsets 
of A* such that 

(i) 0< 7(M) < oo. 

(ii) The product LM is unambiguous. 
Then LM has a density iff L has a density, and if this is the case, 


6(LM) = 6(L)x(M) . (10.1.5) 
Proof. Since the product LM is unambiguous, we have 
fim = fı fm - 
In view of the preceding proposition 
6(LM) = ô( frm) = Ò(fL)0, 


where o = ) „>o T(M N A”) = 7(M). m 


This proposition will be useful in the sequel. As a first illustration of its use, note 


COROLLARY 1.4 Each right (left) ideal I of A* has a nonnull density. Each prefix- 
closed set has a density. More precisely 


6(l) =7(X), where X=I-IAt. 


2 
Proof. Let I be a right ideal and let X = I — IA*. By Proposition EF the set X is 
prefix and 
I=XA*. 
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The product X A* is unambiguous because X is prefix. Further 7(X) < 1 since X is 
a code, and 7(X) > 0 since J 4 Ú and consequently also X 4 Ø. Thus, applying the 
(symmetrical version of the) preceding proposition, we obtain 


5(I) = 5(XA*) = 1(X)6(A*) = 0(X) £0. 


Finally, the complement of a prefix-closed set is a right ideal. n 


Let X be a code over A. Then 7(X) < 1 and 7(X) = 1 if X is thin and complete. 
For a code X such that 7(X) = 1 we define the average length of X (relatively to 7) 
as the finite or infinite number 


X) = J baas niana. (10.1.6) 


zEX n>0 


The following fundamental theorem gives a link between the density and the average 
length. 


THEOREM 1.5 Let X C At be a code and let n be a positive Bernoulli distribution. If 
(i) m(X) =1, 
(ii) A(X) < œ, 
then X* has a density and 6(X*) = 1/A(X). 


Proof. Set fa = (XM A”). Then fx DOzo fnt”. Since X is a code, we have as a 
consequence of Proposition CC1 


fx =- fx)’. (10.1.7) 


Let g = J po gnt” be defined by 


g=(1-— fx = T (10.1.8) 


Identifying terms, we have for n > 0, 


N 
m=i- f 
i=0 


From 7(X) = X £o fi = 1 we get that 


[0.6] 
w= y h (10.1.9) 


i=n+1 


Note that by EEY, gn > 0 for n > 0. Moreover, by MTO and E, 
dime = 3 Y f= lif = XX). (10.1.10) 


n=0i=n+1 1=0 


We see that We ana retry give 
t* = fxg. (10.1.11) 
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Since A(X) is Be d not zero (indeed 7(X) = 1 shows that X # Ø), we can apply 
Proposition . Since 6(t*) = 1, ô( fx») exists and 


ôl fx») (Som) = (fx) A(X). . 


1.5 
Note the following important special case of Theorem i | 


THEOREM 1.6 Let X be a thin complete code over A, and let m be a positive Bernoulli 
distribution. Then X* has a density. Further 6(X*) > 0, A(X) < œ , and 6(X*) = 


1/A(X). 
Proof. Since X is thin and complete code, 7(X) = 1. Next, since X is thin, px > 1 
by Proposition Farts the derivative of fx which is the series 


fx = So nar(X NAPETE, 


n>1 


also has a radius of convergence strictly greater than 1. Hence f%(1) is finite. Now 


fx(1) = So na(XN A") = A(X). 


n>1 


1.5 
Thus A(X) < œ and the hypotheses of Theorem ES are satisfied. m 


2 Probabilities over a monoid 


[sectiond 3] 


A detailed study of the density of a code, in relation to some of the fundamental 
parameters, will be presented in the next section. The aim of the present section is to 
prepare this investigation by the proof of some rather delicate results. We will show 
how certain monoids can be equipped with idempotent measures. This in turn allows 
us to determine the sets having a density, and to compute it. 
; : o = 12 
We need the following lemma which constitutes a generalization of Proposition i E) 


LEMMA 2.1 Let I be a set, and for eachi € I, let 


O= POM, GQ a= P eri 
n=0 n=0 


be formal power series satisfying 
(i) o = Zier Enzo gn’ < 00; 
(ii) 0< fË <1 forallicI, n> o0, 
(iii) 6(f) exists for alli € I. 
Seto = poe: gi? forie I. Then Vie; fg admits a density and 


5 (= 740) = So 5(f 


iel iel 
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We first prove the following “dominated convergence” lemma. It gives a sufficient 
condition to allow one to extend the formula 


ôf +g) = 8f) + 4(g) 


to an infinite sum. 


inatedconvergence| LEMMA 2.2 Let I be a set and for each i € I, let 
. oo . 
uO = Sue", E Rll 


n=0 
be a formal oe series satisfying 


(i) Liez te < oo for alln > 0. 
(ii) 6(u) exists for alli € T. 
(iii) There is a sequence (v) je; of elements of R} such that =. v® < œ and 
u < yl) foralli € I andn > 0. 


Then 
AD u®) = S76 

ie! iel 
Proof Let wn = vier u® and w = endo Wnt” in such a way that w = icy u, We 
show that 

-X 8(u®)| <e 
iEI 

for arbitrary e > 0. Since the series J };ez v is convergent, there is a finite set F C I 


such that J jer- p v® < e. Then Wn — Jier u® < e and thus 6(w) — Vier d(u) < e. 
Since F is finite, 6(0,¢pU) = Jep 6(u) and the result follows. m 


Proof. Let f and g be two series satisfying the hypotheses of Lemma B.T EI 
fOg® and v® = abt gË. We verify that the conditions of Lemma g 2.2} are satishe 


Since rw < 1 for alli € J and n > 0, we have ul) Ss og” - Thus > sez ud < 
D nel Dest g® < oo. This shows that condition (i) is satighed. a by Proposi- 


tion I-J tu) exists for all i € J. Finally, u® < v® and ier v® = ø < œ showin 
EVA rgence 

that condition (iii) is also satisfied. . We can therefore apply Le ma, oO obtain 

Od icr fM%g®) = Dheir 6(fg®). Finally, we apply Proposition © obtain the de- 

sired result. 


2.1 si.3 
Lemma Prtteads to the following proposition which extends Proposition i “3 


PROPOSITION 2.3 Let I be a set and for each i € I, let L; and M; be subsets of A*. 
Let r be a Bernoulli distribution on A* and suppose that 


(i) Vier TMi) < 0, 
(ii) The products L;M; are unambiguous and the sets L;M; are pairwise disjoint, 
(iii) Each L; has a density 5(L;). 
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Then Use LiMi has a density, and 


wel icl 


22.1 
Proof. Set in Lemma PF 


fO =n(L,0 A”), gË =r(M;N A”). 


n 


Then f = fr, gË? = fm,; furthermore 6(f) = 5(L;), o® = q(M;), and in particular 
o = $ 7(M;) < œ. According to the lemma, we have 


ô (= 740) =) ô(Li)r(M;) - 


icl icl 


Since condition (ii) of the statement implies that 


> FOG = > fri fmi = oF ÍLiM: = fier LiMi 
ie icI ie 
the proposition follows. . 

Let p be a morphism from A* onto a monoid M, and let m be a positive Bernoulli 
distribution on A*. Provided M possesses certain properties which will be described 
below, each subset of A* of the form y~! (P), where P C M, has a density. The study of 
this phenomenon will lead us to give an explicit expression of the value of the densities 
of the sets y7! (m) for m € M, as a function of parameters related to M. 

A monoid M is called well founded if it has a unique minimal ideal, if moreover this 
ideal is the union of the minimal left ideals of M, and also of the minimal right ideals, 
and if the intersection of a minimal right ideal and of a minimal left ideal is a finite 
group. 

Any unambigugu _Fgonoid of ‘ Eee finite minimal rank is well founded by 
Proposition and Theorem . It appears that the development given now 
does not depend on the fact that the elements of the monoid under concern are relations; 
therefore we present it in the more abstract frame of well-founded monoids. 

Let p : A* — M be a morphism onto an arbitrary monoid, and let m,n € M. We 
define 

Cmn = {w € A* | my(w) =n} =e (mn) . 


The set Cy is a right-unitary submonoid of A*: for u,uv E€ Crm, we have ny(u) = 
n=ny(uv) = ny(u)p(v) = ny(v). Thus Cr,n is free. Let Xp, be its base; it is a prefix 
code. Let 

Zm,n = Um n — Cmn A” 


be the initial part of Cm,n. Then 
Cm,n = Vienne. os 
and this product is unambiguous. Note also that 


Cin = y7! (n) 
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PROPOSITION 2.4 Let y : A* — M be a morphism onto a well-founded monoid M, 
and letm be a positive Bernoulli distribution on A*. Let K be the minimal ideal of M. 
1. For allm,n€ M, the set Cm nn = yp 1(m—!n) has a density. 
2. We have 


T(LZmn)O(X*) ifne K andm'n#Z9, 


0 otherwise. 
3. For m,n € K such that nM = mM, T(Zm n) = 1 and consequently 
(Cmn) = (Cnn) = (Xn) - 


Proof. Let n € M, withn ¢ K. Thenm~!nnK = 0. Indeed, assume that p € m~!nnk. 
Then mp = n and since K is an ideal, p € K implies n € K. Thus for an element 
n ¢ K, the set Ç, n goes not meet the ideal y~!(K). Consequently Cyn is thin, and 
by Proposition i i Pe =0. 

Consider now the case where n € K. Let R = nM be the minimal right ideal con- 
taining n. Consider the deterministic automaton over A, A = (R, n,n) with transition 
function defined by r -a = ry(a) for r € R,a € A. We have |A| = Xž. Since R is a 
minimal right ideal, the automaton is complete "ae every state is recurrent. 
In particular, X» is a complete code (Proposition 

Let us verify that the monoid y_4(A*) has finite minimal rank. For this, let u € A* be 
a word such that y(u) = n. Since A is deterministic, it suffices to compute rank 4(u). 
Now rank(y,4(u)) = ranka (u) = Card(R- u) = Card(Rn) = Card(nMn). 

By assumption, nMn = nM Mn is a finite group. Thus ba auy) is finite 
and the monoid y4(A*) has finite minimal ran y (orollary e code Xn 
is complete and thin and according to Theorem EFE has a posi ive eee pa 
Zm,n is a prefix set, we have 7(Zmn) < 1. In view of Proposition a set Cm, 

a density and 


Clearly 
Cmn =0 Sw n=0 = Zana 


moreover, 7 being positive, T(Zmmn) > 0 iff Zmn # 0; this shows that ô(Cm,n) Æ 0 if 
mtn #0. This proves (2) and (1). 
3. Let u € A* be a word such that ny(u) = m and ny(u’) Æ n for each proper 
nonempty prefix u’ of u. Then 
UZm,n C Xn 


Thus Zm,n is formed of suffixes of words in X,,, and in particular Zm,n is thin. To show 
that Zm,n is right complete, let w € A* and let n’ = my(w). Then n’ € R = nM, and 
since R is nee minimal, there exists n” € M such that n'n” =n. Let v € A* be such 
that y(v) = n”. Then my(wv) = n, and consequently wv € Cyn. This shows that 
Zm, n = {1} or Zmm is a thin right complete prefix code, thus a maximal code. Thus 
1T(Zmn) = 1. Consequently (Cm, n) = (XÀ). n 


THEOREM 2.5 Let y: A* — M be a morphism onto a well-founded monoid, and let 7 
be a positive Bernoulli distribution on A*. Let K be the minimal ideal of M. 
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1. For alln € M, the set p-+(n) has a density. 
2. We have 


S(p (n))=0 ifn € K; 
for all R-equivalent m,n E€ K, 


(p~ (n)) = 5p" (m n)a (nM)) . 
3. For allne K, 


lp t (nM) - 6(p"*(Mn)) © 


d(p-*(n)) = Card(nM N Mn) 


Note that according to Corollary [I.4 eine one-sided ideal has a positive density; 
thus 6(y~!(nM)) and 6(y~!(Mn)) exist. 
Proof. 

1. Since y-!(n) = C ny the claim results from Proposition pa 

2. By Proposition BEA.) # 0 iff n € K, since Cin is never empty. Let us prove 
the second formula. 

For this, let Y = y-!(K) — yp !(K)A?* be the initial part of the ideal p~!(K). We 
have y-!(K) = YA*. Since the set A* — y!(K) is thin, we have 6(y-!(K)) = 1. 
Consequently 7(Y) = 1. 

For each R-class R of K, consider Yg = Y N y7! (R). Then y!(R) = YRA*, and 
hence, 6(y~!(R)) = (Yr). 

Let now R=nM. Then 


go *(n) = Jeng r))Crn (10.2.1) 
reR 


Indeed, each word w € y~!(n) factorizes uniquely into w = uv, where u is the shortest 
prefix of w such that y(u) € R. Then u € Yr Pes) for some r € R, and v € Crn. 
The converse inclusion is clear. The union in B is disjoint, and the products are 
unambiguous because the sets Yg N y7! (r) are prefix. Each C, n has a density, and 
moreover 


S oarnp ir) = nR) <1. 
reR 


2.2 
Thus we can apply Proposition Bet hich yields 


5(y"*(n)) = X a(Yr N p+ (r))SlCrn) - 
rER 
2.3 
According to Proposition Pi ai the (Crn) for r € R are equal. Thus, for any m € R, 
S(p (n)) = 5(Cmn)t(Yr) = ôl (mn) a (YR) 
(pt (m~*n))d(p7"(R)) . 
3. Set R=nM, L = Mn, and H = ROL. Then we claim that 


L= U mnn K 
mEH 
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and furthermore that the union is disjoint. 

First consider an element k € m~n N K for some m € H. Then mk = n. Thus 
n € Mk, and since n is in the minimal ideal, Mn = Mk. Therefore, k € Mn = L. 
This proves the first inclusion. 

For the converse, let k € L = Mn. The right multiplication by k, 


mre mk 


is a bijection which exchanges the £-classes in K and preserves R-classes (Proposi- 
tion CCOB-2). In particular, this function maps the L-class L onto Lk = L, thus onto 
itself. It follows that there exists m € L such that mk = n. This element is R- 
equivalent with n. Consequently m € H and k € m~!n for some m € H. Since the 
function m +> mk is a bijection, the sets m~!n are pairwise disjoint. This proves the 
formula. 
For al m,n E K, 
öp! (m= nn K)) = öl (m~n)) 


since the set y™i(m ~in N (M — K)) is thin and therefore has density 0. The set H 
being finite, we have 


5(p(L)) = $ amn). 


Using the expression for 6(y~!(n)) proved above, we obtain 


P _ ôl (n)) _ Slp (n)) 
O) = 2 D = DAR) 
This proves the last claim of the theorem. n 


Let E be a denumerable set. A probability measure over E is a function 


H : P(E) > [0,1] 


that satisfies the following two conditions: for any subset F of E, 


MP) = Yo ule) (10.2.2) 


ecr 


u(E)=1 (10.2.3) 


The support of a probability measure u is the set 


S = {e € E | ule) # 0} . 


A probability measure over E is completely determined by a function 


and 


u: E = [0,1] 
such that 
5 usi: 


e€E 
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u is extended to P(E) by ( . 

Note once more that the density 6 defined by a Bernoulli distribution on A* is nota 
probability measure over A* since it does not satisfy les 

We consider, for probability measures over E, the following notion of convergence: if 
(Un)n>0 and u are probability measures over Æ, then we set 


Indeed, in this case, any He F, where F is a subset of E, is summable, and 


L= lim py 


n— o0 


iff for all elements e € E, we have 


u(e)= lim pin(e) . 


n00 


The following proposition is useful. 


PROPOSITION 2.6 Let (un)n>0 and u be probability measures over E, such that u = 
limp—soo HUn. Then for all subsets F of E, 


y(F) = lim pm(F) . 


n— oo 


Proof. The conclusion clearly holds when F is finite. In the general case, set 
o = liminf un(F), T = lim sup un (F), 
and let F = E — F. Of course, 0 < 7 and 
1—7 =liminf un (F) . 


Let F” be a finite subset of F. Then un(F') < un(F) for all n, and taking the inferior 
limit, w(F") < o. It follows that 


w(F)= sup u(F’)<o 
F'CF 
F' finite 


Similarly, u(F) < 1—7. since u(F) + u(F) = u( E) = 1, we obtain 1 < ø + (1 -— 7), 
whence o > 7. Thus ø = T. Since u(F) < o and (F) < 1— ø, one has both (F) < o 
and u(F) > ø, showing that u(F) =o. n 


THEOREM 2.7 Let y: A* — M be a morphism onto a well-founded monoid, and let m 
be a positive Bernoulli distribution on A*. For any subset F of M, the set g™t(F) C A* 


has a density and 
)) = J aly (m). 
mEF 


Proof. It is convenient to introduce the shorthand v(F) = 6(y~1(F)). Let K be the 
minimal ideal of M, let T be the set of its R- asses and A the set of its £-classes. We 
have verified, during the proof of Theorem “that 


V(K)=7(Y)=1, v(R) =7(¥r) 
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where Y (resp., Yp) is the initial part of p7! (K), (resp., of p~!(R), with R € T). Since 
K is the disjoint union of its R-classes, we have 


mY) = D0 (YR) , 
RET 


hence 


Rer LEA 


where the intermediate assertion follows by symmetry. 5A 
Now consider a fixed R-class R € r. Then by Theorem pt 


E v(R)y(Mn) v(L) 
Lv) = oann "2D ann 


LEA nERAL 


and also 
So vin) = Y, (= Hn) =} Wn) = 1. 
nek RET \neR 
Since v(n) = 0 for n ¢ K, it follows that 
2 6g Mn) = 1. 
neM 
Consider the family F of subsets of A*, 
F={p'(m)|meM}. 


The preceding formula gives 


S76) S11. 


TEF 


This shows that there exists a probability measure ô over F , defined for a subset K C F 
by 
Ks y a 


TEK 
For any positive integer n, define un : F — [0,1] by 


pin(T) = —(T AM) , 
n 


where A™ = {w € A* | |w| < n}. The morphism ¢ being surjective, the sets T € F 
form a partition of A*. Thus 
> Hn(T)=1, 


showing that un (n > 1) is a probability measure over F. Moreover, we have 


§(T) = 6() = lim pn(T) 
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r .2.5 
by the definition of a density. Thus 6 = limp_.o. Hn. Proposition Eratstiows that for any 
subset K of F, 
Ò(K) = lm pi, UG) 


n— o0 


Now K = {71 (m) | m € P} for some subset P of M. Thus 


but also 


= E mlm) = Y Salom) a). 
mEP 


Consequently un (K) = (1/n)r(y7!(P)n AM), and thus 


lim =(y-(P)A.A™) = S v(m) . 
mEP 


The left side of the formula is 6(y~!(P)) and the right side is 6(K). Thus 6(y~!(P)) 
exists and has the announced value. m 


PROPOSITION 2.8 Let p : A* — M be a morphism onto a well-founded monoid, and 
let x be a positive Bernoulli distribution on A*. The function v : M — [0,1] defined by 
v(m) = 6(y*(m)) is a probability measure over M. Let K be the minimal ideal of M. 
Then the following formulas hold: 


v(m) # 0 iff mek 
v(m) = v(n~'m)v(mM) ifm, n E€ K and nRm 
v(M') v(M'n K) for M'CM 


For each H-class H C K, andhe H, 
v(H) 


-E 10.2.5 

AA Card(H) ( ) 
Proof. All these formulas with th excepto £ (OZ are immediate consequences of 
the relations given in Theor ao HEE rs that the value of same 
for all h € H by formula - Next v(H) = $ peg v(h). This proves eee n 


EXAMPLE 2.1 Let y : A* — G be a morphism onto a finite group. Let m be a positive 
Bernoulli distribution. For g € G, 


1 


Slet (g) = —— 10.2.6 

P= IG) ( ) 
7 

in view of formula zy and observi at (H = K = G. This gives another method 

for computing the density in Example Ïo that example corresponds a morphism 


p: A* — Z/2Z onto the additive group Z/2Z with y(a) = 1 for any letter a in A. 
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EXAMPLE 2.2 Let y : A* — M be the morphism from A* onto the unambiguous 
monoid of relations M over Q = {1,2,3} defined by a = (a), 6 = y(b), with 
1 
0 
0 


01 0 0 0 
a=|1 0 0 B= 10-0 
110 11 


$3 |. : : 
s minimal ideal J is 


This monoid has already been considered in Example Di 
composed of elements of rank 1 and is represented in Fig. 


001 110 

0 

1 ap apa Ry 

1 

1 

i Pap Ba Re 

Li Lo 
Figure 10.1: The minimal ideal of the monoid M. 6_01 
Let m be the Bernoulli distribution defined by m(a) = p, 7(b) = q, with p+q = 1, 

p,q > O. Let mpute the probability measure v = dy~! over M. With the 
notations of Fig. , we have the equalities 


Lia = Lə Lı = Ly» 
Aa a027) 


9 
Set Xı = y`! (L1), X2 = y! (L2). By BZN, 
Me ing ty) = 0 Xb Ng tJ) 


X2 
Xa Ng (J) = XU X Xoo re tS) = X 02) 


Indeed consider, for instance, the last equation: if w € X,, then y(w) € Ly, hence 
y(wb) € Lə by the fact that L18 = Lo. Thus wb € Xo, and w € Xob! Ny (J). 
Conversely, let w € Xeb-' Ny (J). Since w € yp! (J), w € X4 U Xo. But if w € Xo, 
then y(wb) € ah ans that wb € X1, whence w ¢ X2b7!. Thus w € X4. 


In view of : 


Xia! = Ti Xıb7! = XUT] 
Xa! Xı U Xə U To Xob7! = XUT 


where T1, T!, To, T} are disjoint from y~'(J). Multiplication by a and b on the right 
gives 

Xı = XobdU (Tia U T;b) 

Xo = XaU XoaU X1bU (Tha U T5b) 


Since T; is thin, 6(Z;a) = 6(T1)a(a) = 0, and similarly for the other T’s. Thus 
6(X1) = 4(X2)q, (X2) = ô(X1) + ô(X2)p 


which together with 
6(X1) + 6(X2) =1 
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gives 
q 1 
6(X,) = —, 6(X2) = —— 
(X1) itq (X2) EF 
Thus 1 
q 
L = — L = —— 
v(L1) Tq v(L2) ET 
An analogous computation yields 
p 1 
v( Ri) = —, v( R2) = — 
(R) = TS caer 


In particular, since Rə N Lz = {Ba}, we obtain 
1 


“8e) = Gd +a) 


3 Contexts 


Let X C At be a thin complete code. We have seen that the degree d(X) of X is 
the integer which is the minimal rank of the monoid of relations associated with any 
pee sae trim automaton recognizing X*. It is also the degree of the permutation 
group G(X), it_is lso 2, the minimum of the number of disjoint interpretations in 
X (see Sker on. is section, we shall see that d(X) is related in a quite 
remarkable manner pr e density 6(X*). The set Gx of left completable words is 


Gx = {w € A* | A*w X* 40} = (A*I X* = AX 
symmetrically, the set Dx of right completable words is 
Dx = {w € A* | wA* N X* # 0} = X*(A*)! = X* A7 


THEOREM 3.1 Let X C A* be a thin complete code, and let m be a positive Bernoulli 
distribution on A*. Then 


1 
6(X*) = —— Dx). 10.3.1 3.1 
(x) = gC Ax) (10.3.1) 
Proof. Let A = (Q,1,1) be an unambiguous trim automaton re ng X“, let p be 
the associated morphism and M = y(A%*). In vie gf Corollary e monoid M 
is well founded. Set v = dy~!. By Proposition BS visa Atobability measure over M, 


and the values of v may be nee by the formulas of this proposition. 
Let K be the minimal ideal of M. Since v vanishes outside of K, we have 


(X*) = vy X*)NK) . 


Let Ê be the union of the R-classes in K meeting y(X*), and similarly let Ê be the 
union of those £-classes in K that meet y(X*). Then 


VX) NK) = vX) NRN) 
= Slvly(X*)N#) 
H 
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where the sum is over all H-classes H contained in RN Ê. For such an H-class H , we 


have 
WwX)NEA= E ums yy Ae 


Card(H) 
meyp(X*)NH meyp(X*)NH 


where R and L are the R-class and £-class containing H. Thus 


v(y(X") NH) = aw) | 


Now observe that for any H-class H c ROL, 


Card(y(X*)N H) 1 


Card(H) d(X) ` 


Thus the formula becomes 


5(X*) = ay Rm) = Fn M(Rr(L) 


Now 


yp '(R)=Dxng (K). (10.3.2) 


Indeed, let w € Dx Ny !(K). Then wu € X* for some word u. Consequently, 
p(wu) = (w)y(u) E p(X") NK, showing that the R-class of p(w), which is the same 
as the R-class of y(wu), meets y(X*). Thus p(w) € R. Conversely, let w € we UR), 
Then y(w) € R and there is some m € M such that y(w)m € y(X*)N K. Thus 
=l * : 
wy (m) N x* x we derive that w € Dx. 
It follows from that 


Since A* — y~!(K) is thin, we have 


6(Dx) =6(Dx Ny (K)) . 


Thus 6(Dx) = v(R) and similarly v(L) = 6(Gx). This concludes the proof. m 
-1.6 


The following corollary is a consequence of Theorem 


COROLLARY 3.2 Let X C A* be a thin complete code, and let m be a positive Bernoulli 
distribution on A*. Then 


1 


A(X) = qag iCD) . (10.3.3) 


We observe that for a thi aximal bifix code X C A*, we have Gy = Dx = 
A*. Thus in this, case, ( becomes A(X) = d(X). This gives another proof of 
Corollary [VIB ~ 
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Figure 10.2: An unambiguous trim automaton recognizing X*. 6_02 


EXAMPLE 3.1 Let A = {a,b} and coysider our old friend X = {aa, ba, baa, bb, bba} 
which is a finite complete code. In Fig. an automaton A = (Q,1,1) recognizing X* 
is represented. 


> a pala) = a, ya(b) = 8, where a, are the relations considered in Exam- 
ple 


| Lo obtain the set Dx, we can compute the deterministic automaton having as 
a set of states, the set of row vectors {0,1}°%. The transition function is determined 
by multiplication on the right, with the elements in y4(A*) considered as matrices. 
The part of this automaton that is accessible from the vector 100 is drawn in Fig. 
(Note that this automaton can also be considered as the deterministic a tomaton as- 
sociated with A by the subset construction, as given in Proposition pianetti sets of 
states represented by vectors.) The elements of Dx are the words which are labels of 
paths from 100 to a state distinct from 000. We obtain 


Dx = a* U(a?)*bA* . 
A similar computation on column vectors gives 
Gx =U UAT?) . 


Let m be the positive Bernoulli distribution given by 


Figure 10.3: A deterministic automaton for Dx. 6_03 


with p,q > O, p+q = 1. Then 
6(Dx) = ô(a*) + 8((a?)*bA*) = ô((a?)*bA*) 
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since 6(a*) = 0. Since (a?)*b is a prefix code, we have 
6(Dx) = 1((a7)*b) . 


Thus 


C 
ag Tr 


In a similar fashion, we obtain 


1 
On the ee hand, d(X) =1 since the monoid y,4(A*) has minimal rank 1. Thus by 


formula : 


MX) = (1+p)+4q) - 


This can also be verified by a direct computation of the average length of X. She 
computations made in this example are of course similar to those of Example BS 

Let X C A* be a code. A contezt of w € A* is a pair (u,v) of words such that the 
following two conditions hold: 


UWV = T12 `'-En (zi E€ X) 
and 
lul < ail, fol < [zn]. 


The set of contexts of a word w € A* (with respect to X) is denoted by C(w). The 
set C(1) is C(1) = {(u, v) € At x AT | uv € X}U {(1,1)}. The context of a word can 
be interpreted in terms of paths in the flower automaton A(X) = (P, (1,1), (1,1)): 
there exists a bijection between the set C(w) and the set P(w) of paths labeled w in 
the flower automaton. Indeed let 


c: (uu) = (v',v) 
be a path labeled w in A(X). Then uwv € X*. Thus either uwv = 1, or 
UWV = T172: Ly 


with zj € X and n > O. In that case, |u| < |x| and |v| < |x,|. Thus in both cases, 
(u,v) is a context. Consider another path 


E: (u, 0) = (0, v) 
Then both paths 
Gt) Gy) eee) a 0; 
(1,1) => (u, a’) > @,v) => (1,1) 
are labeled uwv. By unambiguity, c = ¢. Conversely, if (u,v) is a context of w and 
uwv = %1£2Q°++Xp, define two words u’, v’ by 
u =u tr; ifuF¥l, u'’=1 otherwise 


vou, ifvÆĖl, uv'=1 otherwise. 
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Then (u, u’) and (v’,v) are states in A% (X), and there is a path (u, u’) > (v’, v). The 
following result shows a strong relationship between all sets of contexts. 


THEOREM 3.3 Let X C A* be a thin complete code, and let m be a positive Bernoulli 
distribution on A*. For all w € A*, 


MX)= Š x(uo). 


(u,v)EC(w) 


Proof. Let Ap(X) = (P, (1,1), (1, 1)) be the flower automaton of X, let M = yp(A*) 
and set v = dy, . Let w € A*, m = yp(w), and let 


T(m) = {(r,2)€ MxM |rml € yp(x*)} 
tm) = So vrywe 
(r,£)ET (m) 


We compute t(m) in two ways. First define, for each state p € P, 
Rp={rEM|rip=1}, Lp={LEM |b =1}. 


Then rml € pp(X*) iff there exist p,q € P such that rip = 1, Mpq = 1, lg = 1. 
Consequently, 


Tinj= h) Ry Lg 


(p,q) 
Mp,q=1 


Thus 


(p,q) 
Mp,q=l 


Set p = (u,w’), q = (v’,v). Then Mp, = 1 iff there is a path c : p — q labeled w. 
According to the bijection defined above, this hold iff (u,v) € C(w). Next, 


pp (Rp) T X*u, pp (La) = vX*, 


hence 
v(Rp) = 6(X*u) = ô&(X*)n(u), v(La) = 6(vA*) = m(v)ô(X*). 
Consequently 
t(m)= X 5(X*)m(u)m(v)6(X*) = [0X]? X aw). 
(u,v)EC(w) (u,v)EC(w) 


This is the first expression for t(m). 
Now we compute t(m) in the monoid M. Let K be the minimal ideal of M. Since v 
vanishes for elements not in K, we have 


t(m= X vrh. 
(rLEKxXK 
rmley p(X*) 
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Let N = pp(X*) N K. Then 


tm) =S5 3 u(ryv() = So So arema n 


nEN (rL)eKXK nEN reK 
rml=n 


Let r € K. Since (rm)~!n Æ 0 iff rmRn, and since rRrm, we have (rm)~!n F 0 iff 


rRn and 
mE ve m((rmy tn) E MF 


nEN renM nEN renM 


2.6 
by Proposition Bra Farther 


Comparing both expressions for t(m), we get 
1 = 06(X*) `> n(uv). z 
(u,v)EC(w) 


There is an interesting interpretation of the preceding result. With the notations of 
the theorem, set for any word w € A*, 


Call y(w) the contextual probability of w. Then the theorem claims that if m is a 
Bernoulli distribution we have identically 


ied . E , -3.3 
The fact that y and 7 coincide is particular to Bernoulli distributions (see Exercise BS: 
We now study one-sided contexts. Let X C At be a code, and let w € A*. The set of 
right contexts of w is 


C,(w) = {v € A* | (1,v) € C(w)} . 


Thus v € C,.(w) iff wv = 2122-+- £n, (zi E€ X) with |v] < [zn]. 
Symmetrically, the set of left contexts of w is 


Ci(w) = {u € A* | (u,1) € C(w)} . 


We observe that 


C (w) X* =w! X* . (10.3.4) 


The product C,(w)X* is unambiguous, because X is a code. 


PROPOSITION 3.4 Let X C A* be a thin complete code and let A = (Q,1,1) be an 
unambiguous trim automaton recognizing X*. Let K be the minimal ideal of the monoid 
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M = p4(A*). Let be a positive Bernoulli distribution. 
For all w € yy (K) N Dy, we have 


m(C,.(w))6(Dx) = 1. (10.3.5) 
For all w € yz (K) N Gx, we have 
m(Co(w))6(Gx) = 1. (10.3.6) 


Proof. Set Y = ya, v = dy, and let R (resp., Ê) be the union of the R-classes Igsp., 
£-classes) in K that meet y(X™). We have seen, in the progf of Theorem | that 
6(Dx) = v(R) and 6(Gx) = v(L). According to formula BST 


5(w7!X*) = a(C,(w))S(X*) . 


Set n = (w) and T = {k € K | nk € y(X*)}. Then T C L since for k € T, 
we have nk € Mk N p(X*), showing that the left ideal Mk meets y(X*). Let H 
be an H-class contained in Ê. The function h > nh is a bijection from H onto the 
H-class nH. Since n € R, we have nH C Ê; since H € L we have nH C Ê. Thus 
nH c ROL. This implies that nH N y(X*) # Ø. Indeed let R and L denote the 
R-class and £L-class containing nH, and take m € RN y(X*), m’ € LN y(X*). Then 
mm’ E€ RO LN y(X*) =n y(X*). 

Setting d = d(X), it follows that 


Card(n N y(X*)) 1 


Card(nH) d’ 


Since HNT = {k € H | nk € (X*)} is in bijection with nH N y(X*), we have 


1 
Card(H N T) = Card(nH N y(X*)) = 7 Card(H) . 


Therefore 
v(T)= = So vu(ANT) - È cata a y Card(H OT) 
HcL (H) 
v 1 ss 
HCL 


We observe that me) = w!X*n gy }(K). Consequently 6(w-1X*) = v(T) = 
(1/d)v(L). Since also v(L) = 6(Gx), we obtain 


6(w 1 X*) 156(Gx)d(Dx) 
: = ——— (Dx) = — —————_ . 10.3. 
3 
By Theorem Bathe last expression is equal to 1. = 


PROPOSITION 3.5 Let X C AT be a thin complete code. Let n be a positive Bernoulli 
distribution on A*. For all w € A* the following conditions are equivalent. 
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(i) The set C,(w) is maximal among the sets C,(u), for u € A*. 
(ii) a(C,(w))6(Dx) = 1. 


Proof. With the notations of Proposition Bett “consider a word z € y 1(K)N X*. 
Then C,(w) C C,(aw), hence also 7(C,.(w)) < m(C,(xw)). On the other hand zw € 
y 1(K)M Dx. Indeed the right ideal generated by x is minimal, and there oye, there 
exists v € A* such that y(awv) = y(x). Thus rwv € X*. By Proposition ue we have 
1(C,(aw))d(Dx ) = 1 showing that 


m(C;(w)) < 1/6(Dx) . (10.3.8) 
Now assume C,(w) maximal. Then C,(w) = C,(rw), i ipg the equality sign in 
the formula. This proves (i) = (ii). Conversely formula ( shows the implication 


(ii) > (i). n 
In fact, the set of words w S A* such that, the set of right contexts is maximal is an 
: : apter : ction2bis. 
old friend; in Chapter bs. Section B we defined the sets of strongly left completable 
and simplifying words by 


E 
S 


{u € A*|Vv € A*, Jw € A* : www € X*}, 
{u € A*\Va € X*,Vv € A* : xuv E€ X* > we X*}. 


xten 


We ave Sgen, that, these sets are equal aa they are both nonempty (Exer- 


cise I : can be shown (Exercise iat, for a thin complete code X, the 
following three conditions are equivalent for all words w € A*: 

(i) w €E, 

(ii) w ES, 


(iii) C,(w) is maximal. 
This leads to a natural interpretation of qrmula GUE (see Exercise S We now 
establish, as a corollary of formula ( a property of ite, maxim l code yhich 
generalizes the property for prefix codes shown in Chapter ' i (Theorem ae 
THEOREM 3.6 Let X C At be a finite maximal code. For any letter a € A, the order 
of a is a multiple of d(X). 


Recall that the order of a is the integer n such that a” € X. 

Proof. Let be a positive Bernoulli distribution on A*. Let A = (Q,1,1) be a trim 
unambiguous automaton recognizing X*. Let K be the minimal ideal of the monoid 
M = vpa(A*). Let z € X* N yy (K). According to Proposition Ba 


m(C,(%))6(Dx) =1, m(Ce(x))d(Gx) =1. 
By formula (TEE the average length of X is 


d(X) 


ae TO OA 


Thus 
A(X) = d(X)r (Cr (x))r(Ce(x)) . 
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Let a be a fixed letter and let n be its order. Consider a sequence (Tk)kķ>0 of positive 
Bernoulli distributions such that 


lim mla) = 1 
k—oo 


and for any b € A—a, liMmk—oœ Tk(b) = 0. For any word w € A* we have limg_,.. Tk(w) = 
1 iff w € a*, and = 0 otherwise. For any k > 0, denote by A(X) the average length of 
X with respect to mk. Then 


A(X) = dX) re (Cr(x)) Ke (Ce(@)) . 


But also, by definition 


A(X) = SO lelle) . 


rEx 


Since X is finite, this sum is over a finite number of terms, and going to the limit. 
lim A,(X) = x| lim mlz). 
in, M00 = D le fin a 


Since liMmk—oo T(x) = 0 unless x € a*, we have 
lim A(X) =n, 
where n is the order of a. On the other hand, 


me(Cr(a)) = S mlo). 


veC; (x) 


The words in C,(x) are suffixes of words in X. Since X is finite, C, (x) is finite. Thus, 
going to the limit, we have 


jim, (Cp (e)) = 5. jim Te(v) = Card(C, (£) Na*) . 


vEC; (a2) Bi 
Similarly 
jim, Tr(Ce(x)) = tP im TK(v) = Card(Co(x) N a*) . 
veEC?e(x) 
Thus 
n = d(X) - Card(C;,(x) N a*) Card(Ce(x) N a*) 
This proves that d(X) divides n. n 


4 Exercises 


g section6.1 
Section 


1.1 Let X be a recognizable subset of A* and let 7 be a positive Bernoulli distribution. 
(a) Show that px = oo iff X is finite. 
(b) Show that if X is infinite, then px is a pole of the rational function fx. 
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g section6.2 
Section 


2.1 Show that any recognizable subset of A* has a density. 


2.2 Let M be a monoid, and let u, v be two probability measures over M. The 
convolution of u and v is defined as the probability measure given by 


px v(m) = S> pluvio). 


(a) Show that 
(lim tn) x V= lim (un*v). 


(b) Let r be a positive Bernoulli distribution on A*. For n > 0, let 7) the proba- 
bility measure defined by 


m")(L) = 2(LN A”) 
for L C A*. Show that 


gt) =g) krh. 


and let v = 6y~' be the probability measure over M defined in Proposition 2.8t 
Show that v is idempotent, i.e, 


(c) Let py: A* — M bea morphism onto a well-founded monoid. Let 7 be as a ny 


Vxý=VŅ. 


2.3 Let A = (Q,i, T) be a complete deterministic automaton and let m be a positive 
Bernoulli distribution on A*. A function 


7: Q —> [0,1] 
is a stationary distribution if 
À Zg ra) = 1. 
(ii) X pa- V(P)™(a) = 74) E Q). 
where the sum above runs over p € Q and a € A. 


(a) Let p be the representation associated with A and let M = y(A*). Show 
that if A has finite minimal rank, and if all states of A are recurrent, it admits 
a unique stationary distribution given for p € Q by 


y(p) = 1/A(Xp) , 


where Xp is the prefix code such that X$ = Stab(p). where Lpq is the set 
recognized by the automaton 
(b) Let d be the minimal rank of M, with the hypotheses of (a). Let € be the 
set of minimal images of A. Let B be the deterministic automaton with states 
E and with the action induced by A. Let y be the stationary distribution of A, 
and o that of B. Show that 


CORESI 


where the sum runs over the minimal images I € € such that q € I. 
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i section6.3 
Section 


3.1 Let X C At be a thin complete code. Let S(X) and E(X be the sets of sim- 
plifying and strongly left completable words defined in Chapter [lbis: ow that for 
w E€ A* the following conditions are equivalent: 
(i) w € S(X); 
(ii) w € E(X); 
(iii) C, (w) is maximal among all C, (u), u € A*. 


3.2 Let X be a thin maximal code. Let U be the prefix set such that E(X) = UA*, 
where E(X) = S(X) denotes the set of strongly right completable (and simplifying) 
words. (a) Show that for any word w;nA* the product C,(w)U is unambiguous and 
the result is prefix. 


(b) show that for any w € S(X), the set C,(w)U i maximal prefix. 


(c) use this to give another proof of formula 


3.3 Let a: A* — [0,1] be a function. Then 7 is called an invariant distribution if 
(i) m1) = 1; 
(ii) Yaca Twa) = Jaca (aw) = 1(w) for w € A*. 
Let X C At be a code and a: B* > A* a coding morphism for X, i.e., a(B) = X. 


Let 7 be an invariant distribution on B*. Show that the function 7° from A* into [0, 1] 


defined by 
n° (wo) = iw 5 n(a™t(uwv)) 


a) (u,v)EC(w) 


with A(a) = Xex |z|7(a~*(x)) is an invariant distribution on A*. Compare with the 
definition of the contextual probability. 


5 Notes 


Theorem E special case of a theorem of Feller which can be rephrased as follows 
in our context: “Let X C At be a code, and let m be a positive Bernoulli distribution 
such that 7(X) = 1. Let p be the g.c.d. of the lengths of the words in X. Then the 
sequence 

mX*= NA) (n>0) 


has a limit, which is 0 or ne according to A(X) = co or not” (see (L968), 


Theorem XIII.3.3). Theorem is less precise on two points: (i) we only consider 
the case where A(X) < oo and (ii) we only consider the limit in mean of the sequence 
m(X* Nn A”). 


The results of Section ids results, can be found in 
1965). Theorem is due to (3650). a TE 
from (03. 

Exercise 2.1} belongs to the folklore f,aytomata theory. The notion of stationary 
distribution introduced in Exercise PPE classical one in the theory of Markov pro- 


cesses. 
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Further developments of the results presented in this chapter may be found in 


chard and Perrin] (1980) and (L983). In particular these papers 


discuss the relationship of the concepts developed in this chapter with ergodic theory. 
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Chapter XI 


Factorizations of Free Monoids 


0 Introduction 


tion7bis. -4.1 
The main result of Section eorem characterizes factorizations of free monoids. 


It shows in particular that the codes which appear in these factorizations are circular. 
The proof is based on an enumeration technique. For this, we define the logarithm in 
a ring of formal power series in noncommutative variables. The properties necessary 
for the proof are derived. We illustrate the factorization theorem by considering a very 
general family of factoyizations obtained from sets called Lazard sets. 

Section 41s devoted to the study of factorizations into finitely many submonoids. 
We first con: jder,factorizations into two submonoids called bisections. The main result 
(Theorem Baie a method to construct all bisections. We then examine trisectio 
or factorizations into three submonoids. We prove a difficult result (Theorem 
showing that every trisection can be obtained by “pasting” together factorizations into 
four factors obtained by successive bisections. 


.5.4 


1 Factorizations 


section7bis.1 


Several times in the previous sections, we have used special cases of the notion of factor- 
ization which will be defined here. We shall see in this section that these factorizations 
are closely related to circular codes. Let I be a totally ordered set and let (X;)je7, be 
a family of subsets of A indexed by J. An ordered factorization of a word w € A* is 


a factorization 
W = T12 En (11.1.1) 


with n > 0, x; E X;, such that jı > jo > ++- jn- 

A family (X;)ier, is a factorization of the free monoid A* if each word w € A* has 
exactly one ordered factorization. 

If (X;)ier, is a factorization, then each X is a code, since otherwise the, ynique 
factorization would not hold for words in X;. We shall see later (Theorem j that 
each X; is in fact a circular code. 

Let us give a formulation in formal power series. Consider a family (o;)je7 of formal 
power series over an alphabet A with coefficients in a semiring K, indexed by a totally 


415 
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ordered set J. Assume furthermore that the family (0;)ie7, is locally finite. Let J = 
{j1,Jj2,--+;Jn} be a finite subset of J, with jı > jo > --- > jn . Set 


TJ = OjO ja’: Oj 


n 


Then for all w € A*, 


(Tj, w) = > (Oji, T1) (Oja £2) (Cjn 2n), (11.1.2) 


LIL En =W 


Let A be the set of all finite subsets of J. Then the family (77) z¢, is locally finite. 
Indeed, for each word w € A*, the set F'(w) of factors of w is finite. For each x € F(w), 
the set I, of indices i € I such that (o;i, x) Æ 0 is finite. From aea it follows that 
if (TJ, w) #0, then J C Uzer(w)Lz . Consequently there are only finitely many sets J 
such that (77,w) #0. These considerations allow us to define the product 


o =|[a+ai) 


ier 


by the formula 


=) r 


JEA 


If I is finite, we obtain the usual notion of a product of a sequence of formal power 
series, and the latter expression is just the expanded form obtained by distributivity. 

Consider a family (X;)jcr, of subsets of At indexed by a totally ordered set I. If the 
family is a factorization of A*, then 


A*=|[X? (11.1.3) 


ier 


Conversely, if the sets X; are codes and if the sgmigroups xe are pairwise disjoint, 
then the product [],-;Xj is defined and ives? implies that the family (X;)iez is a 
factorization of A*. 

19 
EXAMPLE 1.1 Formula ( St tes, that the family (Xn41,2n,---,21) is a factoriza- 
tion of A* for all n > 1. Lemma says that the family of sets (Y,...,2%n,2n-1,---,21) 
is a factorization of A*. 


The main result of this section is the following theorem. 


THEOREM 1.1 Let (Xi)icr be a family of subsets of At indexed by a totally ordered set 
I. Two of the three following conditions imply the third. 
(i) Each word w € A* has at least one ordered factorization. 
(ii) Each word w € A* has at most one ordered factorization. 
(iii) Each of the X; (i € I) is a circular code and each conjugacy class of 
nonempty words meets exactly one among the submonoids X¥. 
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The proof is based on an enumeration technique. Before we give the proof, we need 
some results concerning the logarithm of a formal power series in commuting or non- 
commuting variables. We shall consider a slightly more general situation, namely, the 
formal power series defined over monoids which are direct products of a finite number 
of free monoids. Let M be a monoid which is a direct product of finitely many free 
monoids. The set 


S=Q”v 


of functions from M into the field Q of rational numbers is equipped with the structure 
of an algebra as it was done for formal series over a free monoid. In particular if 
o,7 E€ S, the product ar given by 


(oT,m) = y (a, u)(T, v) 


is well defined since the set of pairs (u,v) with uv = m is finite. As in the case of formal 
power series over a free monoid, a family (0;);e7 of elements of S is locally finite if for 
all m € M, the set {i € I | (oi; m) # 0} is finite. Define 


SM = {0 € S | (0,1) = 0}. 


For ø € S“), the family (on)n>0 of powers of ø is locally finite. Indeed, for each m € M, 
(o",m) = 0 for all n greater than the sum of the lengths of the components of m. This 
allows us to define for all o € S$“, 


log(1 +0) = o — 0? /2 + 08/3 = BSI) tto” /n +- (11.1.4) 
o? o” 
exp(o)=1 +0 +5 ttar te (11.1.5) 


Let M and N be monoids which are finite direct products of free monoids. Let S = Q™ 
and T = QN. A morphism 
a:M—>T 


from the monoid M into the multiplicative monoid T is called continuous iff the family 
(a(m))mem is locally finite. In this case, the morphism a can be extended into a 
morphism, still denoted by a, from the algebra S into the algebra T by the formula 


ala) = `> (o, m)a(m). (11.1.6) 
mEM 


This sum is well defined since the family (a(m))mem is locally finite. The extended 
morphism a is also called a continuous morphism from S into T. For any locally finite 
family (o;)ier of elements of S, the family a(o;)ie7 also is locally finite and 


X a(oi) =q (= ni) (11.1.7) 


ie ie] 
7 
According to formula ffhtqa continuous morphism a: S$ — T is entirely determined 


by its definition on M, thus on a set X of generators for M. Furthermore, a is con- 
tinuous iff a(X — {1}) c T® and the family (a(x))z¢x is locally finite. This is due to 
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the fact that each m € M has only finitely many factorizations m = £1£2:-- £k with 
y Zk E X — 1. It follows from MEE that if o € S™®, then a(o) € T®. From 
, we obtain 


log(1 + a(ø)) 
exp(a(7)) 


a(log(1+0)), (11.1.8) 
a(exp(c)). (11.1.9) 


According to classical results from elementary analysis, we have the following formulas 
in the algebra Q[|[s]] of formal power series in one variable s: 


exp(log(1+s))=1+s, log(exp(s)) = s. (11.1.10) 


Furthermore, in the algebra Q[[s,t]] of formal power series in two commuting variables 
s,t, we have 


exp(s + t) = exp(s)exp(t), log((1+s)(1+t)) =log(1+s)+log(1+t). (11.1.11) 
Let M be a monoid which is a finite direct product of free monoids and let s = Q™, Let 
ao € S and let a be the continyoys morphism, from the algebra Ql[s]] into S defined 
by a(s) =o. Then by formulas ae | , we have 

exp(log(1+o))=1+0, log(exp(c)) =a (11.1.12) 


showing that exp and log are inverse bijections of each other from the set S' onto the 
set 


1489 ={14r[res}. 


Now consider two series 0,7 € S (1) which commute, i.e., such that or = To. Since 
the submonoid of S generated by o and 7 is commutative, the function a from s* x t* 


p à goea by a(s?t?) = o? 7? is a continuous morphism from Q|[s,t]] into S and by 


7 


exp(7) exp(7), 
log(1 +0) + log(1 +7). (11.1.13) 


explo + T) 
log((1+¢)(1+7)) 


These formulas do not hold when o an o not commute. We shall give a property 
of the difference of the two sides of ( in the general case. A series o € Q = 
Q((A)) is called cyclically null if for each conjugacy class C C A* one has 


(@c) =) (ew) =0 


wEC 


Clearly any sum of cyclically null series still is cyclically null. 


PROPOSITION 1.2 Let A be an alphabet and let S = Q((A)). Leta: S > S bea 
continuous morphism. For each cyclically null series o € S, the series a(o) is cyclically 
null. 
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eq7.4.9 


o 
Re) 
N 
nN 
= 
O 


() 
re} 
N 
De 
RB 
B 


() 
re} 
N 
N 
m. 
N 
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Proof. Let T C A* be a set of representatives of the conjugacy classes of A*. Denote 
by C(t) the conjugacy class of t € T. Let 


r=¢_| SO ww -2) 


teT \wecC(t) 


The family of polynomials ($ Zuec) (7, w)(w — t))ter is locally finite. Thus the sum is 


well defined. Next 
T = D >DE o, w)w -5 we o,w)t 


teT weC(t) teT weC(t) 
a OES X (oC (t))t 
teT 


Since ø is cyclically null, the second series vanishes and consequently 7 = ø. It follows 
that 


=E | E owalu) - a(t) 
teT \weC(t) 
In order to prove the claim, it suffices to show that each series a(w)— a(t) for w € C(t) 
is cyclically null. For this, consider w € C(t). Then t = uv, w = vu for some u,t € A®*. 
Setting u = a(u),v = a(v), one has 
a(w) — a(t) = vu — pv 
Next, 


yu= > (v,x)(u,y)zy 


xz,yeEA* 
Thus, 
vu = pv = 5 (v, x) (MH, yxy 


x,yeEAt 
Since each polynomial xy — yx clearly is cyclically null, the series vu — uv and hence 
a(o) is cyclically null. : 
PROPOSITION 1.3 Let A = {a,b}, and let C be a conjugacy class of A*. Then 
(log((1 + a)(1 +b)), C) = (log(1 +a), C) + (log(1 + b),C) (11.1.14) 
In other words, the series log((1 +a)(1 + b)) — log(1 + a) — log(1 +b) is cyclically null. 


Proof. One has (1 + a)(1+ 6) =1+a+b6-+ ab and 


log((1+a)(1+5)) = > eure 


m>1 


(a + b+ ab)™ 


Let w € A”, and let d be the number of times ab occurs as a factor in w. Let us 
verify that 


(a+ b+ ab", w) = ( i ). (11.1.15) 
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Indeed, ((a + b+ ab)”, w) is the number of factorizations 
W = %1%2°** Lm 


of w in m words, with x; € {a,b,ab}. Since w has length n and the z;’s have length 1 
or 2, there are exactly n — m x;’s which are equal to ab. Each factorization of w thus 
corresponds to a choice of n — m factors of w equal to ab among t 3 occurrences of 
ab. Thus there are exactly (2 d m) factorizations. This proves ( 
Now let C be a conjugacy class, let n be the length of the words in C and let p 
their exponent. Then Card(C F = n/p. If C C a*, then C = {a"}. Then formula 
Pe ns hat ((q,+ b + ab)™,a”) equals 1 or 0 according to n = m or not. Thus 
both sides of eee in this case are equal to (—1)"/n. The same holds if C C b*. 
hys, we may assume that C is not contained in a* U b*. Then the right-hand side of 
equals 0. Consider the left-hand side. Since each word in C contains at least 
one a, there is a word w in C whose first letter is a. Let d be the number of occurrences 
of ab as a factor in w. Among the n/p conjugates of w, there are d/p which start with 
the letter b and end with the letter a. Indeed, set w = v?. Then the word v has d/p 
occurrences of the factor ab. Each of the d/p conjugates of w in bA*a is obtained by 
“cutting” v in the middle of one occurrence of ab. Each of these d/p conjugates has 
only d— 1 occurrences of ab as a factor. The (n — í 4 e conjugates of w have all d 
occurrences of the factor ab. According to formula , we have for each conjugate 


u of w, 
(Sn if u € bA*a, 
n—m 
((a+b+ab)™,u) = 
d : 
( ) otherwise. 
n—m 
Summation over the elements of C gives 
dfd-1 —d d 
(at b+abyr.c)=S (871) E 
p\n-—m p \n-m 
Since (pm) = (d—n + m/d)(,,%,,), we obtain ((a + b + ab)™,C) = (m/p) (n m): 
Consequently 


(log(1 + a)(1+6),C) = - So (-1)""" a (11.1.16) 


m>1 


since n > d and d £0, this alternating sum of binomial coefficients equals 0. n 
-4.3 


The following proposition is an extension of Proposition 


PROPOSITION 1.4 Let (o;)ier be a locally finite family of elements of Q((A)) indexed 
by a totally ordered set I, such that (o;,1) =0 for alli € I. The series 


log [ [+ 04) — So log(1 + 4) (11.1.17) 


iEl icl 


is cyclically null. 
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Proof. Set S = Q = Q((A)), and SY = {0 € S | (o,1) = 0}. Let o,r € SM. The 
series 
6 = log((1 +0) (1 +7)) —log(1 +0) — log(1 + T) 


. 3 
is cyclically null. Indeed, either o and 7 commute and 6 is null by RET, or the 
alphabet A has at least two letters a,b. Consider a continuous morphism a such that 
a(a) = 0, a(b) =r. The series 


d= (log((1 + a)(1 + b)) — log(1 + a) — log(1 + b) 


4.3 4.2 
is cyclically null by Proposition Ea sice ô = a(d), Proposition EF stiows that 6 is 
cyclically null. Now let T1, T2,...,Tn E€ 1+ S®. Arguing by induction, assume that 


c= log(Tn 7s log Ti 


is cyclically null. In view of the preceding discussion, the series 
= log(Tn a T2T1) E log (Tn eig T2) — log 7 


is cyclically null. Consequently 
e+e = log(Tn D log 7; 


is cyclically null. This proves EEY for finite sets J. For the general case, we consider 
a fixed conjugacy class C. Let n be the length of words in C and let B = alph(C). 
Then B is finite and C C B”. Define an equivalence relation on S' by 


o~r iff (o,w)=(r,w) foral we BM”, 


(Recall that BI] = {w € B* | |w| < n}.) Observe first that o ~ T implies o* ~ 7* for 
all k > 1. Consequently o ~ 7 and o,r € S® imply log(1 + o) ~ log(1 + 7). 
Consider the family (7;)ier, of the statement. Let 


Ip = {i € I | oj ~ 0}, I'=I-Ip 


Then I’ is finite. Indeed, for each w € Bl”! there are only finitely many indices i such 
that (o;,w) #0. Since B is finite, the set B” is finite and therefore I’ is finite. 
Next observe that 


[[G+e) ~ [[a+o), (11.1.18) 


tel ier’ 


since in view of HI we have (TJ, w) = 0 for w € B”) except when J C I’. It 


follows from ( 


log (Te + “| ~ log (The + a) 


iEI iel’ 


J. Berstel, D. Perrin and C. Reutenauer Version 22 janvier 2008 


422 XI. Factorizations of Free Monoids 


(i (Te +a) c) = (i (Ile +a) a) . 


Next, since g; ~ 0 for i € Ip, we have log(1 + 0;) ~ 0 for i € Ip. Thus 


(Saso) o) - (Sa + ci), o). 


icl icl 


Consequently 


From the finite case, one obtains 


(i NG + n) a) = (Sa + n)a) . 
ier! ier’ 


Putting all this together, we obtain 


(we (Ia +20) ¢) = (Erea +o; o) - (Sa San; o). 


Thus the proof is comnie, 


orem we need a final lemma which is a reformulation of EEE E 
Ra and i eu 


PROPOSITION 1.5 Let X C At be a code. 


1. For each conjugacy class C meeting X*, we have 
(log X*,C) > (log A*,C), 


and the equality holds if X is a circular code. 
2. Conversely if (log X *,C) = (log A*,C) for all conjugacy classes that meet 
X*, then X is a circular code. 


Proof. We have X* = (1 — X.)~!. Thus log(X*(1 — X)) = 0. since the series X* and 
1—X commute, we have 0 = log X *+log(1—X_), showing that log X * = — log(1 -X ). 
Thus 


log X* = SX" 


m>1 


In particular, if C C A™ is a conjugacy class, then 


(log X*,C.) = > = Card(x" ©), 


m>1 


For X = A, the formula becomes 
1 
(log A*, C) = = Card(C). 
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and i = 


2134 
and ae computations, the proposition is a direct consequence of Propositions i F 


.4.1 
Proof of Theorem ssume first that conditions (i) and (ii) are satisfied, i.e., that 
t mily (X;)ier, is a factorization of A*. Then the sets X; are codes and by formula 


we have 
A*=]|[X?. (11.1.19) 


tel 


Taking the logarithm on both sides, we obtain 


log A* = log (11x) . (11.1.20) 
icI 
4.4 
By Proposition feat tire series 


6 = log A* -X log X} (11.1.21) 


iel 


is cyclically null. Thus for each conjugacy class C 


(log A*,C) = 5 “(log X7,C). (11.1.22) 


iel 


S a -4.5 . 
In view of Proposition E we have for each i € J and for each C that meets X¥ the 


inequality 
(log A*,C) < (log X}, C) (11.1.23) 
Formulas HT and HTA show that for each conjugacy class C, there exists a 


unique j € J such that C meets X¥. For this index j, we have 


(log A*,C) = (log X}, C). (11.1.24) 


Thus if some X, gts a conjugacy class, no other X* (i € I — j) meets this conjuga is 
class. Since ier holds, each of the codes X; is a circular code by Proposition [Eg 
This proves condition (iii). 

Now assume that condition (iii) holds. Let C be a conjugacy c ss and let i € I be the 
unique index such that Xý meets C. Since X; is circular, é se holds by Propgsitign 

and furthermore (lo Xz, C) = 0 for all j Æ i. Summing up all equ lities ( ; 
we obtain Eq. ( . This proves that the series 6 defined by (aot is cyclically 
null. 

Let a be the canonical morphism from Q((A)) onto the algebra Q|[A]] of formal 
power series in commutative variables in A. The set of words in A* having the same 
image by a is union of conjugacy classes, since a(uv) = a(vu). Since the series ô is 
cyclically null, the series RE since & is a continuous morphism, we obtain by 
applying a to both sides of ( 


0 = loga(A*) — 5 log a(X7) 
icl 
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Hence 


log a(A*) = X log a(X7) (11.1.25) 
iol 


* 


p 4,4 
* exists. By Proposition frat the 
toe([].X2) -X log X} 


iEl iEl 


Next, condition (iii) ensures that the product ] [er X 
series 


is cyclically null. Thus its image by a is null, whence 


toga([] 27) = Drga. 
iel iel 
This together with ES shows that 
loga(A*) = toga([] x"). 
tel 


Since log is a bijection, this implies 


a 


[> 
Í 
Q 
= 
[>< 
a 


This means that a(€) = 0, where 


Observe that condition (i) means that all the coefficients of € are negative or zero. 
Condition (ii) says that all coefficients of € are positive or zero. Thus in both cases, 
all the coefficients of e have the same sign. This together with the condition a(€) = 0 
implies that e = 0. Thus if condition (iii) and either (i) or (ii) hold, then the other one 
of conditions (i) and (ii) also holds. 


| 
A factorization (X;)jer, is called complete if e hX i is reduced to a singleton x;. The 
following result is a consequence of Theorem f 7 : 


COROLLARY 1.6 Let (x;)ier be a complete factorization of A*. Then the set X = {z; | 
i € I} is a set of representatives of the primitive conjugacy classes. In particular, for 


alln > 1, 
Card(X N A”) = 4n (k) (11.1.26) 
with k = Card(A). 


-4.1 
Proof. According to condition (iii) of Theorem ; each conjugacy class intersects 


exactly one of the submonoids X¥. In view of the same condition, each code {x;} is 
circular and consequently each word x; is primitive. This shows t X is a system of 
representatives of the primitive conjugacy classes. Formula ( is an immediate 
consequence. n 
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Now we describe a systematic procedure to obtain a large class of co lete factor- 
izations of free monoids. These include the construction used in Section p 


A Lazard set is a totally ordered subset Z of At satisfying the following property: 
For each n > 1 the set Z N Al = {21, 22,---, Zk} with 21 < 22 < --- < z satisfies 


zi € Zi for 1<i<k, and Zi n A = 0, 
where the sets Z1,... 2,41 are defined by 
WHA, 24] 2)(4,-4) Ost < k). 
(Recall that Al”! = {w € A* | |w| < n}.) 


EXAMPLE 1.2 Let (£n)n>1 be a Hall sequence over A such that £n is a word of minimal 
length in X,, and let Z = {æn | n > 1} be the subset of A* ordered by the indices. 
Then Z is a Lazard set. 


-3.8 
EXAMPLE 1.3 Let (%n)n>1 be the sequence used in the proof of Theorem ecall 
that we start with X, = A and 


PETE = x; (X; — zi) 1 > 1, 


where z; is a word in X; of minimal odd length. Denote by Y the set of even words in 
Ui>1 Xi Now set Yı = Y and for i > 1, 


Yii = y; (Yi — yi), 
where y; € Y; is chosen with minimal length. Let T = {2;, yi | i > 1} ordered by 
Ti < ta < EKL Bn KA L U K G K 
The ordered set T is a Lazard set. Indeed, let n > 1 and 
TOAM = {£1, £2, ..., Er, Y1, Udy + - -3 Ys} 
Set 


Z, = Ao lais r+l, 
Zrii = YilZryi— yi 1<i<s. 


We show by induction on 7 that 


Zin A =y nA (<i<s+1). (11.1.27) 


Indeed, words in X;41 = Zr+1 of length at most n all have even length (since the words 
with odd length are x1, %2,...,£r). Thus all these words are in Y = Yı. Conversely, 
any word of even length < n is already in X,+1, since |x,41| > n. 

Next, consider y € Y;+ı N A]. Then y = y?y’ for some y' € Y; — yi- since |y'| < n, 
we have by the induction hypothesis y’ € Z,4;, whence y € Zr+i+1. The converse is 


proved in the same,way. 
Equation Ham shows that y; E€ Z,4; for 1 < i < s and that Zr+4s+1 N All = 9. 
Thus T is a Lazard set. 
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PROPOSITION 1.7 Let Z C At be a Lazard set. Then the family (z)zez is a complete 
factorization of A*. 


Proof. Let w € A* and n = |w]. Set ZnAM = {215 20;+++,2,} with 2) < Z2 < +++ < Zp. 
Let Zı = A and Zj41 = Z7(Z; — zi) fori = 1,2,,, he q hen zi E€ Zi fori=1,2,...,k 
, | 7 pi 7 
and Zę}ı N Aln] = Ø. As in the proof of Lemma EaI e have for 1<i< k, 
Zi = Zivizir 
whence by successive substitutions 


A* = Z kyk o i. (11.1.28) 


Thus there is a factorization w = YZi; Zio *** Zi, With y E€ Zkņ}ı and i > i>- > 
in. since Zķ41 N Al] = Ø, we have y = 1. This proves the existence of an ordered 


factorization. Assume there is another factorization, say w = tits ''tm. with tj € Z, 
tı > tg >- > tm. Then t; € ZN AM for each i. Thus by ( both factorizations 
coincide. n 


We conclude this section with an additional example of a complete factorization. 
Consider a totally ordered alphabet A. Define a relation called alphahetic ordering on 
A* by setting u < v if u is a proper prefix of v, or if u = ras, v = rbt, a < b for a,b E€ A 
and r,s,t € A*. The alphabetic ordering has the property 


U<V S WU ~ wV. 


By definition, a Lyndon word is a primitive word which is minimal in its conjugacy 
class. In an equivalent way, a word w € AF is a Lyndon word iff w = uv with u,v € AT 
implies w < vu. Let L denote the set of Lyndon words. We shall show that (l)jer is 
a complete factorization of A*. For this we establish lemmas which are interesting on 
their own. 


PROPOSITION 1.8 A word is a Lyndon word iff it is smaller than all its proper nonempty 
right factors. 


Proof. The condition is sufficient. Let w = uv, with u,v € At. since w < v and 
v < vu, we have w < vu. Consequently w € L. Conversely, let w € L and consider 
a factorization w = uv with u,v € At. First, let us show that v is not a prefix of w. 
Assume the contrary. Then w = vt for some t € At. since w € L, we have w < tv. 
But w = uv implies uv ~ tv. This in turn implies u < t whence, multiplying on the 
left by v, 


vu < vt = w, 


a contradiction. Suppose that v < uv. Since v is not a prefix of w, this implies that 
vu < uv and w ¢ L, a contradiction. Thus wv < v, and the proof is completed. n 


PROPOSITION 1.9 Letl,m be two Lyndon words. Ifl < m, then lm is Lyndon word. 
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Proof. First we show that Im < m. If | is a prefix of m, let m = lm’. Then m < m' 
by Proposition —Thus lm < Im! = m. If lis not a prefix of m, then the inequality 


l< m directly implies Im < m. t 4% be a nonempty proper suffix of Im. Ifv is a 
suffix of m, then by Proposition m< v. Hence Im < m < v. Otherwise v = v'm 
for some proper nonempty suffix v f l Phen l< uv’ and consequently Im < v'm. Thus 
in all cases lm < v. By SAAN Paes shows that Im € L. n 


THEOREM 1.10 The family (Dier is a complete factorization of A*. 


Proof. We prove that conditions (i) and (iii) of Theorem aie satisfied. This is clear 
for condition (iii) since L is a system of representatives of primitive conjugacy classes. 
For condition (i), let w € At. Then w has at least one factorization w = ¢;09--+n 
with 4; € L. Indeed each letter is already at Lyndon word. Consider a factorization 


w = hibo- hy 


into Lyndon words with minimal n. Then this is an ordered factorization. Indeed, 
otherwise, these would be some index i such that 4; < Zi+1 But then l = 4;li+1 € L and 
w would have a factorization into n — 1 Lyndon words. Thus condition (i) is satisfied. 

| 


Z .4.4 
It can be seen (Exercises Seay that the set L is a Lazard set. 


2 Finite factorizations 


In this section we consider factorizations (X;)je7 with J a finite set. These are families 
Xn,Xn—1,---,X1 of subsets of A* such that 


Aa RIK giv et (11.2.1) 


According to Theorem frit each Xi is a circular code and each conjugacy class meets 
exactly one of the X;. The aim of this section is to refine these properties. We shall 
see that in some special cases the codes X; are limited. The question whether all codes 
appearing in finite factorizations are limited is still open. We start with the study of 
bisections, that is, factorizations of the form (X,Y). Here X is called left factor and Y 
is called right factor of the bisection. Then 
A*=x*y* 


(11.2.2) 


EXAMPLE 2.1 Let A = {a,b}. The pair (a*b, a) is a bisection of A*. More generally, if 
A= Ap U A; is a partition of A, the pair (Aj.A1, Ao) is a bisection of A*. 


2 
Formula (tH can be written as 


YX+A=X4Y. (11.2.3) 
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leq7 5—2 
Indeed, (TT Z2) is equivalent 7 4 =% )(l — X) by taking the inverses. This 
gives (1.2.3). Equations ( and ( show that a pair (X,Y) of subsets of 


A* is a bisection iff the following are satisfied: 


Ac XUY, (11.2.4) 
XAY =), (11.2.5) 
YXCXUY (11.2.6) 

each z € X UY, z ¢ A factorizes uniquely into z = yx with 
(11.2.7) 

rEX,yey. 
5.4 

We shall see later (Theorem at a subset of these conditions is already enough 


to ensure that a pair (X,Y) is a bisection. 

Before doing that, we show that for a bisection (X,Y) the code X is (1,0)-limited 
and the code Y js, (9,,1)-limited. Recall that a (1,0)-limited code is prefix and that 
by Proposition Bes efi code X is (1,0)-limited iff the set R = A* — XA* isa 
submonoid. Symmetrically, a suffix code Y is (0, 1)-limited iff the set S = A* — A*Y is 
a submonoid. 


PROPOSITION 2.1 Let X,Y be two subsets of At. The following conditions are equiv- 
alent: 

(i) (X,Y) is a bisection of A*. 

(ii) X,Y are codes, X is (1,0)-limited and Y* = A* — X A*. 

(iii) X,Y are codes, Y is (0,1)-limited and X* = A* — A*Y. 


Proof. (i) = (ii) From A* = X *Y* we obtain by multiplication on the left by 1 — X 
the equation (1— X )A* = Y *, showing that Y * = A*— X A*. The number of prefixes 
in X of any word w € A* is (X A*,w). The equation shows that this number is 0 or 1, 
according to w € Y* or w ¢ Y*. This proves that X is a prefix code. This also gives 
the get, relation Y* = A* — X A*. Thus A* — X A* is a submonoid and by Proposition 
p the code X is (1, 0)-limited, 

(ii) > (i) By Theorem [LJ, we have A* = X *R with R = A* — X A*. Since R = Y* 
and Y is a code, we have R = Y *. Thus A* = X *Y *. 

Consequently (i) and (ii) are equivalent. The equivalence between (i) and (iii) is 
shown in the same manner. , 


COROLLARY 2.2 The left factors of bisections are precisely the (1,0)-limited codes. 


Observe that for a bisection (X,Y), either X is maximal prefix or Y is maximal 
suffix. Indeed, we have Y* = A* — X A*. If Y* contains no right ideal, then X A* is 
right dense and consequently X is maximal prefix. Otherwise, Y * is left dense and thus 
Y is maximal suffix. 
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PROPOSITION 2.3 Let M,N be two submonoids of A* such that 
A*=MN. 
Then M and N are free and the pair (X,Y) of their bases is a bisection of A*. 


Proof. Let u,v be in A* such that uv € M. Set v = mn with m € M, n € N. Similarly 
set um = m'n’ for some m’ € M, n’ € N (see Fig. ). Then wv = m'(n'n). Since 
uv E€ M, unique factorization implies n = n’ = 1. Thus v € M. This shows that M 
satisfies condition C(1,0). Thus M is generated by a (1,0)-limited code X. Similarly 
N is generated by a (0, 1)-limited code Y. Clearly (X,Y) is a factorization. n 


EXAMPLE 2.2 Let M and N be two submonoids of A* such that 


MAN={1}, MUN=A*. 


Figure 11.1: Factorizations. 
We shall associate a special bisection of A* with the pair (M, N). For this, let 
R={r € A |r=w=sve M} 
be the set of words in M having all its suffixes in M. Symmetrically, define 
S = {s € A* | s= uv = ue N}, 


The set R is a submonoid of A* because M is a submonoid. Moreover, R is suffix- 
closed. Consequently the base of R, say X, is a (1,0)-limited code. Similarly S is a 
free submonoid and its base, say i ; #,(0, 1)-limited code. We prove that (X,Y) is a 
bisection. In view of Proposition Z.I 1t suffices to show that Y* = A* — X A*. First, 
consider a word y € Y* = S. Then all its prefixes are in N. Thus no prefix of y is in 
X. This shows that Y* C A* — X A*. Conversely, let w € A* — X A*. We show that 
any prefix u of w is in N by induction on |u|. This holds clearly for |u| = 0. Next, if 
|u| > 1, then u cannot be in R = X* since otherwise w would have a prefix in X. Thus 
there exists a factorization u = u’v' with v’ ¢ M. Hence v’ € N and v' Æ 1. By the 
induction hypothesis, u’ € N. since N is a submonoid, u = uv’ € N. This proves that 
wes=YyY*. 

A special case of this construction is obtained by considering a morphism y : A* => Z 
into the additive monoid Z and by setting 


M {m € A* | p(m) > 0} U {1}, 
N = {ne A* | y(n) < 0}. 
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Given a word w € A*, we obtain a factorization w = rs with r € R,s € S as follows. 
The word r is the shortest prefix of w such that y(r) is maximal in the set of values of 
y on the prefixes of w (see Fig. ): 


5.2 
The construction of Example = can be considered as a special case of the very general 
following result. 


THEOREM 2.4 Let (P,Q) be a partition of At. There exists a unique bisection (X,Y) 
of A* such that X C P and Y C Q. This bisection is obtained as follows. 
Let 


X,=PNA, Y=QNA (11.2.8) 
and for n > 2, 
Zn = UY Ani (11.2.9) 
Xn = Zn NAP, Yn = 2ZnNQ (11.2.10) 
Then 
X= Un>1Xn and Y = Uns1Yn: (11.2.11) 
oA r sie s >l 


a b a a b a b b 
Figure 11.2: (a) = 1, y(b) = —1, w = abaabab 


Proof. We first prove uniqueness. Consider a bisection (X,Y) of A* such thatX c P 
and Y C Q. We ow that for n > ob we have XN A” = Xn, Y O An = Yn, with Xn and 

Yn given by Pog ana Arguing by induction, w iger n = 1. Then 
XQAACPAA= X. a a a a oe 
Consequently PAO A C X. Thus X N A =X, For n > 2, we have Zn C YX N An 
by the induction hypothesis. Thus by ead Zn C (XUY) Cc A”. This implies 
that Zn S£ CX 4A” and ZaNQ CY € A”. Conversely, let z € (X UY) € A”. 
Then by hae z = yx for some y € Y, ð By the induction hypothesis, y € Y; 
and x € Xn; for i = |y|. In view of fi aan we have z € Z,. This shows that 
(XUY)NA”" C Zn. Hence X N A” CZ,NP and YN A" CZ,NQ. e 

To prove the existence of a bisection, we consider the pair (X,Y) given in (M.Z,11 


We proceed, jn several steps. Define Zı = A and set Z = Un>1Zn gw of ( 
and pont we have Z = X UY. Observe first that by formula fear 


YXUA=XUY. (11.2.12) 


Clearly RETE implies YX C X UY. By induction, we obtain 
Y*X* c X*UY*. (11.2.13) 
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Next, we have 
A* = X*Y*. (11.2.14) 

Indeed, let w € A*. Since A C Z, the word w has at least one factorization w = 
2122°*+ 2, with zj E€ Z. Choose such a factorization with n minimal. Then we cannot 

ave 21,6 Y, zi+1 E X for some 1 < i < n—1, since this would imply that z;2;41 € Z by 
fie contradicting the minimality of n. Consequently there is some j € {1,...,n} 
such that 21,..., zj; € X and 2j;41,...,2n € Y, showing that w € X*Y*. 

Now we prove that X* is suffix-closed. For this, it suffices to show that 


uv E X > ve xX”. (11.2.15) 


Indeed, assuming HET. consider a word L= rs € X*. Then r = r'u,s = vs’ for 
some r’,s’ € X* and wv € X U1. By (ZI). X* and consequently, s € X*, 
opang that X* is suffix-closed. We prove (1.2.15) by induction on the let 
x = uv. Clearly the formula holds for |z| = 1. Assume |z| > 2. Then by (MaRi 
z = yızı for some yı € Y,xzı € X. If yı is not a letter, then again by ( 
Yı = Y2x2 for some yo € Y, xq E€ X. Iterating this operation, we obtain a factorization 
T = YpUR ++ L221 with Yg E Y,NA and z1,..., £k E X. 

Each proper suffix v of x has the form v = vp%p_1--- 21 for some suffix vp of £p and 

< Sek By the induction hypothesis, vp E€ X*. Consequently v € X*. This proves 
i pase An analoguous proof shows that Y* is prefix closed. 

Next we claim that 


X*nY* = {i}, (11.2.16) 


and prove this claim by induction, showing that X* N Y* contains no word of length 
n > 1. This holds for n = 1 because X N Y = Ø. Assume that for some w € A”, there 
are two factorizations © = 11%2°--XLp = Y1Y2'': Yq With x; E€ X, yj € Y. Since Y™ is 
prefix closed, we have xı € Y*. Since X* is suffix closed yg E€ X*. Thus zı © XN Y* 
and yg E€ X* NY. By the induction hypothesis this is impossible if x; and yq are shorter 
than w. swe have p = q = 1. But then w € X QY =9, a contradiction. This 
proves (i ae Now we prove that X is prefix. For this, we show by induction on 
n > 1 that no word in X of length n has a proper prefix in X. This clearly holds for 
n=l. 

Consider uv € X N A” with n > 2 and suppose that u € X. In view of (far, 
we have uv = yx for some y € Y,x € X. The word u cannot be a prefix of y, since 
tl ise u would be in X N Y* because Y* is prefix closed and this is impossible by 
(2.10) y Thys there is a word u’ € AT such that u = yu’, u'v = z. 

By 2T , u! € X*. Moreover |z| < n. By the induction hypothesis, the equation 
x = u'v implies v = 1. Thus u = w, showing the claim for n. Consequently X is 


ea A similar proof shows that Y is suffix. 4 

We now are able to show that (X,Y) is a bisection. Equation BE shows that 
any word in A* admits a factorization. To show uniqueness, assume that xy = x'y' for 
x,x' € X* and y,y' € Y*. Suppose |z| > |z2’|. Then xz = z'u and uy = y' for some word 
u. i i is suffix closed and Y* is prefix closed, we have u € X* MN Y*. Thus u = 1 


by ( . Consequently x = x’ and y = y’. since X and Y are codes, this completes 
the proof. 7 n 
Theorem ‘Shows that the following method allows us to construct all bisections. 
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(i) Partition the alphabet A into two subsets X; and Yj. 

(ii) For each n > 2, partition the set Zn = Ujz1n — 1Y;Xn—; into two subsets X, 

and Yn. 

(iii) Set X = Un>1Xn and Y = Un>1 Yn. 

In other words, it is possible to construct the components of the partition (P,Q) 
progressively during the computation. A convenient way to represent the computations 
is to display the words in X and Y in two columns when they are obtained. This is 
illustrated by 


EXAMPLE 2.3 Let A = {a,b}. We construct a bisection of A* by distributing iteratively 
the products yx (x € X,y € Y) into two columns as shown in Fig. . All the 
remainding products are put into the set R. This gives a defining equation for R, since 
from AUYX = XUY and X = {a,ba} UR we obtain R = {b,b?a}RU b?a{a, ba}. 
Thus R = {b,b?a}*b?a{a, ba} or also R = (bob a)*p?b* afa, ba}. Consequently X = 
(b?b*a)*{a, ba}, which is the code of Example E 


X Y 

1 a b 
2 ba 

3 bba 
S4 R 


Figure 11.3: A bisection of A”. 


The following convention will be used for the rest of this section. Given a code X 
over A, a pair (U,V) of subsets of A* will be called a bisection of X* if 


a 


To fit into the ordinary definition of bisection, it suffices to consider a coding morphism 
for X. 
A trisection of A* is triple (X,Y, Z) of subsets of At, which form a factorization of 
A*. 
A SX YZ (11.2.17) 


We shall prove the following result which gives a relationship between bisections and 
trisections. 


THEOREM 2.5 Let (X,Y,Z) be a trisection of A*. There exist a bisection (U,V) of Y* 
and a bisection (X', Z') of A* such that (X,U) is a bisection of X™ and (V,Z) is a 
bisection of Z™, 


Before giving the proof we establish some useful formulas. 
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PROPOSITION 2.6 Let (X,Y,Z) be a trisection of A*. 
1. The set X*Y* is suffiz-closed and the set Y*Z* is prefix-closed. 
2. One has the inclusions 


Y*X* c X UY*Z*, (11.2.18) 
OY" C BOY (11.2.19) 


3. The codes X,Y and Z are (2,0)-, (1,1)-, and (0, 2)-limited, respectively. 


Proof. We first prove 1. Let w € X*Y%*, and let v be a suffix of w (see Fig. E. 
Then w = uv for some u. Set v = xyz with x € X*, y € Y*, and z € Z*. Set also 
uxy = x'y'2' with x’ € X*, y' € Y* and z’ € Z*. Then 


w = wv = ugzyz = x'y' (zz). 


O O O 
! u i v 
ò 
' x y z 
—_ l —— 
x y z 
Figure 11.4: The set X*Y™ is suffix-closed 7_11 
oo : o 
oo =i zZ 
Py 
1X” y” 
oOo——— oo 
Figure 11.5: Y*X* C X* UY*Z* 7_12 


Uniqueness of factorization implies z’ = z = 1. This shows that v € X*Y* and pr VES 
that X*Y* is suffix-closed. Likewise Y*Z* is prefix-closed. We now verify GE 
Let x € X* and y € Y*. Set yx = x'y'z' with xv’ € X*, y! € Y*, and z € Z*. If 2’ = 1, 
then yx € Y*Z*. Thus assume that x’ 41. The word x’ cannot be a prefix of y since 
y € Y*Z* and Y*Z* is prefix-closed X*Y*Z* = {1}. Thus there is a word u such 
that x’ = yu and x = uy’z’ (see Fig. . since u is a suffix of x’ € X*Y%, it is itself in 
X*Y*. Thus u = 2”y" for some x” € X* and y” € Y*. Thus g = 2"y"y'z'. Uniqueness 


of factorization implies y” = y'= z’ = 1. Consequently yx = x'y'z! = a! € X*. This 
proves SRE Formula BET is proved symmetrically. 

The code X is (2, 0)-limited. Indeed, let u, v, w € AT be words such that uv, vw € X*. 
Since v and w are suffixes of words in X* and since X*Y* is suffix closed, both v and 
w are in X*Y*. Thus 

v=2Y, w= ry 
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for some x, x’ € X*, y,y’ € Y* (see Fig. ITO). The word y'x is a suffix of uvz € X*. 
Thus by the same argument, y’x is in X*Y* and consequently y'x = xy” for some 
x" € X* and y” € Y*. Thus vw = g'z"y"y. Since by assumption vw € X*, uniqueness 
of factorization implies that y” = y = 1. Thus w = 2 € X*. This proves that X is 
(2,0)-limited. Likewise Z is (0, 2)-limited. 

To show that Y is (1, 1)-limited consider words u,v, w € A* such that ww, vw € Y*. 
Then v € X*Y* because v is a suffix of the word wv in X*Y* and also v € Y*Z* as a left 
factor of the word vw in Y*Z*. Thus v € X*Y* 1 Y*Z*. Uniqueness of factorization 
implies that v € Y*. This completes the proof. n 


Figure 11.6: The code X is (2,0)-limited 
-5.5 
Proof of Theorem Eset S = {s € Y* | sX* C X*Y*}. First, we observe that 
S = {s € Y* | sX* c X*UY*}. (11.2.20) 


Indeed, consider a word s € Y*. If sX* C X*UY*, then clearly, sX* C X*Y*. Assume 
ony sely that sX* C X*Y*. Since s € Y* we have sX* C Y*X* and it follows by 
(ETH that sX* C X*UY*Z*. Thus sX* C APY 1 (X*UY*Z*) = CY ox) U 
(X*Y*NY*Z*) = X*UY* by uniqueness of factorization. This proves ( . Next, 
S is a submonoid. Indeed, 1 € S and if s,t € S, then stX* C sX*Y* C X*Y*. We 
show that the monoid S, considered as a monoid on the alphabet Y satisfies condition 
C(1,0). In other words, s,t € Y* and st € S imply t € S. Indeed, consider x € X*. 
since tz is a suffix of stx € X*Y™* and since X*Y* is suffix closed, tr € X*Y*. Thus 
t € S. This shows that S is a free submonoid of Y* generated by some (1, 0)-limited 
oe C Y+. Note that U is (1,0)-limited as a code over Y. According to Proposition 
, the code U is the left factor of some bisection (U,V) of Y*, with V* = Y* —UY*. 
We shall give another definition of V. For this, set 


R= {r € Y* | rX* n Z* #0}. 
Clearly RNS = 1. We prove that 
R*=V*. (11.2.21) 


First,we show that R C V*.Let r € R— {1}. Set y = st with s € Yt, te Y*. Since 
r € R, we have stx € Z* for some x € X*. By award we have tx € X* U Y*Z*. If 
ta € Y*Z*, then st € Yt Z* which is impossible since stz € Z*. Consequently ta € X*. 
Thus s € R. Since RNS = {1}, it follows that s € S. Thus no prefix s € Y* of r is in 
S. In other words, no left factor of r is in the code U. This proves that r is in V*. 
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Second, we prove that V* C R*. We proceed by induction on the length of words in 
V, the case of the empty word being trivial. Let v € Vt . Since ( yi oe factorization, 
we have U* N V* = {1}. Consequently v € U* = S. Thus by , there is some 


x € X* such that vz € X* U Y*. Since v € Y*, we have by (1.2.18) ux € Y*Z*, and 
by a previous remark even vx € Y*Z* . Set vz = yz with y € Y*, z € Zt. Then z 
cannot be a suffix of x, since otherwise z would be in X*Y* N Z*, which is impossible. 
Thus there is some word w € AT such that 


V = YW, WT = Z. 


since w is a suffix of v € X*Y*, we have w € X*Y*. Similarly w is a prefix of z € Y* Z*. 
Thus w € Y*Z*. Uniqueness of factorization implies w € Y*. The word y is in V*. 
Indeed, y € Y* is a prefix of v, and since V* is prefix closed as a subset of Y*, y € V*. 
since |y| < |v|, we have y € R* by the induction hypothesis. On the other hand, 


w E€ Y* and wz = z € Z* imply w € R. Thus v = yw € R*. This completes the proof 
of ( . Up to now, we have proved that 


A*=X*U*V"*Z" (1.2.22) 


with Y* =U*V*, S = U* and R* = V*. To finish the proof, it suffices to show that 
ucts M = X*U* and N = V*Z* are submonoids. Indeed, since the product 
(Go is ona gugus, we have M = X*U* and N = V*Z* whence A*= MN. 
By Proposition BY the monoids M and N then are free and their bases constitute the 
cone bisection (X', Y’). To show that X*U* is a submonoid it suffices to show that 
* X* U n Tins: let us consider words z € X* and s € U* = S. Then by 
sx E€ X*UY*. But sx € Y* implies sx € S because sxX* C sX* C X*UY*. 
Thus sx C X* US, showing that X*U* is a submonoid. Finally we show that V*Z* is 
a submonoid. For this, we show that 


ZRCRUZ. (11.2.23) 


This will aera Z* R* C R*UZ* which in turn proves the in Ge of (tray 


To show ( , let z € Z* andr € R. since r € Y*, formula implies that 
zr € Z* U X*Y*. Next, by definition of R, rx € Z* for some x € X*. Thus zrg € Z*. 
since Y* Z* is prefix-closed, we have z € Y*Z*. Thus uniqueness of factorization shows 
that zr € Z* z s If zr € Y*, then zr € R, since zrx € Z*. Thus zr € Z* UR and 


this proves ( m 


+5,6 
Theorem shows that all trisections can be built by “pasting” together quadrisec- 
tions obtained by a sequence of bisections. The following example shows that it is false 
for any trisection to be obtained by two bisections. 


EXAMPLE 2.4 Let A = {a,b}. The suffix code Z’ = {b, ba, ba?} is (0, 1)-limited. Thus 
Z' is the right factor of the blsection (X', Z’) of A* with X’* = A* — A* Z'. The equation 


ZX! +A=Z'+X! 
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derived from (tay gives A — Z’ = (1—Z')X' whence X’ = Z"(A —Z’). It follows 
that 
X’ = Z"(a—ba—ba*) 
a — aad eee Z"ba)a 
a0 ieee. \-Z"b-Z"™ba)a 
“ba” )a. 


Thus 
X' = Z"*ba U {a}. 


Next define 
U = (ba)*ba®, V=ba, Z= {b,ba?}(ba)*. 


The pair (V, Z) is clearly a bisection of Z’™*. Moreover, by inspection U C X’. This 
inclusion shows that, over the alphabet X’, the set U is the right factor of the bisection 
(X,U) of X™ with X = U*(X' — U). Moreover, U*V* = {ba, ba®}*. Then setting 


Y = {ba, ba*}, 
(U,V) is a bisection of Y*. Thus we have obtained 
A*=X X"*Z"* = XUY ae — Ga ie ae 


and (X,Y, Z) is a trisection of A*. Neither X*Y* nor Y*Z* is a submonoid. Indeed, 
ba € Y anda € X (since a € X’—U). However, ba? € Z and consequently ba? € X*Y*. 
Similarly b € Z and ba? € Y but b?a? € X whence b?a® c Y*Z*. This means that the 
trisection (X, Y, Z) cannot be obtained by two bisections. 


3 Exercises 


section7bis.1 
Section 


1.1 Let A = {1,2,...,n} and for j € A, let Xj = j{j + 1,...,n}*. Show that the 
family (X;)i<j<n is a factorization of A*. 


1.2 Let y: A* — R be a morphism into the additive monoid . For r € R, let 
= {v € AT | (v) =rlv|}, Br = Cr — (UsarCs)A™. 
Show that the family (B,-)-er (with the usual order on R is a factorization of A*. 


1.3 The (left) standard factorization of a Lyndon word w € L — A is defined as the 
pair 
m(w) = (l,m) 
of words in A* such that w = Im and £ is the longest proper prefix of w that is in L. 
Show that m € L and £ <x m. 
Show that if r(w) = (l,m) and (£) = (p,q), then p x1 < m. 
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1.4 Show that the set L of Lyndon words over A is a Lazard set. 
Hint: Set DM A” = {2}, z2,..., Zk} with z% < z2 <--- < zk. Show that z; € Zi for 
1<i<k where 


Z, = A, 


Ziv 


Show that Z; contains all z, such that 1(z,) = (zs, 2) with s <i<r. 


: section7bis.2 
Section 


2.1 Show that if a factorization A* = X}X*_,---Xf is obtained by a composition of 


n<*n-1~ 
bisections, then X; is a (i — 1,n — i)-limited code. 


2.2 Let X be a (2,0)-limited code over A. Let M C A* be the submonoid generated 
by the suffixes of words in X. Show that M is right unitary. Let U be the prefix code 
generating M. Show that there exists a bisection of A* of the form (U, Z). Show that 
X, considered as a code over U is (1,0)-limited. Derive from this a trisection (X, Y, Z) 
of A*. This shows that any (2,0)-limited code is a left factor of some trisection. 


2.3 Let A = {a,b,c,d,e, f,g} and let Y = {d, eb, fa, ged, dac}. Show that Y is (1, 1)- 
limited. Show that there is no trisection of A* of the form (X,Y, Z). 


2.4 Let y € At be an unbordered word. Show that there exists a trisection of A* of 
the form (X,y,Z). Show that a prefix (resp., a suffix) of y is in Z* (resp., X*). (Hint: 
First construct a bisection (X’, Z) of A* such that X” is the submonoid generated by 
the suffixes of y.) 


4 Notes 
The notion of a factorization has a introduced by Bchiitzenberger| ({L965a)) in the 


paper where he proves Theorem he factorizations of free a are very 
closely related with decompositions in direct sums of free Lie algebras. This topic 
is not mentioned here. A co eje treatment can be found in (L978) and in 
(1983). Proposition RoE special case of a statement known as the Baker- 
Campbell-Hausdorff formula (see, e.g., ({L983))). The notion of a Lazard set is 
due to (K979). A series of examples of other factorizations and a bibliography 
on this field can be found in (98 3a Finite factorizations were studied by 

eee Nica and Viennot. Theorem [2 a is from ({L965a)). Theorem 

is due to [Viennot] (L974). [Viennot] (L974) contains other results on finite factor- 


izations. Among them, th G ig a necessary and sufficient condition in terms of the 
construction of Theorem or the factors of a bisection to be recognizable. He also 
a 4 construction of trisections analoguous to that of bisections given in Theorem 


-4.2 
factorization of Tt Fae auc to Spitzer (see (1L983)). Exercises 
"AR are from [Viennot] (1974) 
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Chapter XII 


The Polynomial of a Code 


0 Introduction 


There is a noncommutative polynomial canonically associated with a finite code: it is 
the sum of the codewords, minus 1. When the code is maximal, this polynomial has 
some striking factorization properties, which reflect probabilistic and combinatorial 
properties of the code, such as the property of being prefix, suffix or synchronizing. 
When the code is prefix, the factorization is directly related to the tree representation 
of the code. When the code is bifix, one ha: even more combinatorial data explaining 
the factorization, as explained in Chapter hc general case, the factorization 
of the polynomial has no direct combinatorial interpretation, but is related via the 
factorization conjecture to a kind of coset decomposition of the free monoid with respect 
to the submonoid generated by the code. The factorization conjecture is the main open 
problem in the theory of codes. 


The loos organized as follows. In Section i ae pate factorizations. In 
Section 2, we state the esa theorem (Theorem , which is the main result 
of this chapter. Section § presents some results on noncom utative, polynomials which 
are used in the proof of the factorization theorem. Section fess the proof of the 
theorem. Section Fy presents some ap lications of the factorization theorem. These 
results already appear in Chapter ut can be easil leriyed using the factorization 
theorem (although only for finite codes). Section pf mtroduces another equivalence, 
called the commutative equivalence. It is conjectured that any finite maximal code is 
commutatively equivalent to a prefix code. It is shown that this is a consequence of the 
factorization conject re. in the sense that any factorizing maximal code is commuta- 
tively prefix. Section FFs a specialized topic concerning the reducility property of 
the linear representation associated to an automaton. We prove that the minimal rep- 


resentation associated with the submonoid generated y3 mą imal code is completely 
Es and FS 


reducible if and only if the code is bifix (Theorems 
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1 Positive factorizations 


Let X be a subset of At. A pair (P, S) of subsets of A* is called a positive factorization 
for the set X if each word w € A* factorizes uniquely into 


w = sxp (12.1.1) 


is.1 
with p€ P, s € S, x E€ X*. In terms of formal power series, (Petty can be expressed 
as 


A* =8X*P (12.1.2) 


Note the analogy with the coset decomposition of a grou jth respect to a subgroup. 
Observe that 1 € P and 1 € S. Taking the inverses in eae obtain 


1-X =P(1-A)s (12.1.3) 


or also 


X —1=PAS — PS (12.1.4) 
This equation shows that each word in X can be written in at least one way as 
z = pas 


withpeEe P,acA,sEeS. 


PROPOSITION 1.1 A set X for which there is a positive factorization (P, S) is a code. 


is.4 
Proof. Indeed, (tS tay implies that A* = S(X_)*P which in turn shows that (X )* 
has only coefficients 0 or 1. . 


A code X is positively factorizing if there exists a pair (P, S) of sets which is a positive 
factorization for X. 

A prefix code X is positively factorizing. Indeed, let P = A* — X A* be the set 
of words having no prefixes in X. Then A* = X*P and thus (P, {1}) is a positive 
factorization for X. Conversely, if (P,{1}) is a positive factorization for X, then the 
code X is prefix. Indeed, if u,uv € X*, then setting v = xp with x € X* and p €E P, 
we obtain (ux)p € X*, which implies p = 1 by the uniqueness of factorization. Thus 
X* is right unitary. 

Symmetrically, for a suffix code X, one has A* = S X * with S = A* — A*X. 

If X is a bifix code, then simultaneously 


A*=X*P and A*=SX* 


with P = A* — X A* and S = A* — A*X. This shows in Poe gpi there may exist 


several positive factorizations for a code (see also Exercise 
Recall that for a thin maximal bifix code X, we have 


X -1=d(X)(A-1)+(A-1T(A-D. 
Hence A* = X*P = SX* with 


PHadx) 4-17. S=d09 474-71), (12.1.5) 


Version 22 janvier 2008 J. Berstel, D. Perrin and C. Reutenauer 


eq8.Obis.1 


eq8.0bis.2 


eq8.0bis.3 


eq8.0bis.4 
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Let X C At be a positively factorizing code and let (P, S) be a positive, fact, riza ion 
for X. If P and S are thin, then X is a thin maximal code. Indeed, Eq. ws 
that X C PAS. Since P, A, S are thin, the product PAS is thin also and consequently 
X is thin. Furthermore, X is complete. Indeed, let u € F(S) and v € F(P). For each 
w in A* the word uwv is in SX*P. By the choice of u and v, it follows that w is in 
F(X*). Thus X is complete. 

As a special case, note that if P and S are finite, then X is a finite maximal code. 
We shall see later that, conversely, if (P, S) is a positive factorisation for a finite max- 
imal code, then P and S are finite. There exist finite godes which are not positively 
factorizing. An example will be given in Section 6. However, no finite maximal code is 
known which is not factorizing. Whether any finite maximal code is factorizing is still 


unknown. This constitutes the torigation conjecture. 
Observe that in view of ( e composition of positively factorizing codes is 
again a positively factorizing code. 
EXAMPLE 1.1 Let A = {a,b}, and let 
X = {a‘, ab, abaf, abab, aba? ba? , aba*ba, aba? ba? , aba?b?, aba?b?a?, b, ba? }. 


The set X is a positively factorizing code. Indeed, an easy computation gives 


1-X = (1 +a+ aba’ (1 +a+b))(1-— a- b)(1 +a’). (12.1.6) 


Thus this is a positive factorization (P, S) with 
P = {1,a,aba?, aba?,aba*b}, S= {1,07}. 


Since P and S are finite, X is a maximal code. We may verify that X is indecomposable. 
This is the smallest known example of a fini gmaximal inde Qmpogable code which is 
neither prefix nor suffix (see Example ices and Exercise foe-lbs. t 

The rest of this example illustrates the relation between the positive factorization and 
the structure of the transition monoid of an unambiguous automaton. The computation 
allows, in some cases as the present one, to recover the positive factorization directly 
from the monoid. 


. Ls a . ig-automateCesari 
An unambiguous automaton A recognizing X* is represented on Figure i 1S 
automaton can be used as follows to recover the positive factorization for X given 


by ( j We first compute the deterministic automaton obtained by applying the 

determinization algorithm to the automaton A starting from {1}. The result is shown 
i =determinization . HUN 

on Figure EF This automaton has a unique minimal strongly connected component 


corresponding to the rows of the elements of the minimal ideal of the monoid M = 
palA*). We then apply the determinization algorithm backwards to_th¢ automaton 


. i . terminization 
A starting also from state {1}. The result is shown on Figure B5 (we represent 


only part of the result, containing the unique minimal strongly connected component). 


Let L he the set. of states of the strongly conected component of the automaton of beens: 
. =determinization . =codeterminization 
Figure and le e the set of states strongly connected component of Figure BS 
Any ele nt ol L intersects any element of C in exactly one element, as shown on 
Table in which the elements of L appear as the columns an the elements of C 
as the rows (this is true for any thin maximal code, see Exercise . We select the 
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Figure 12.1: The automaton A fig-automateCesari 


Figure 12.2: The result of the determinization fig-determinizatio 


Cae tyes 
6,7, 10 


Figure 12.3: The result of the backwards determinization fig-codeterminizat 


state l = {1,9} in L and the state c = {1,2,7,8,10} in C . The set of labels of simple 
paths from £ to 1 is S = {1,aa} and the sets of labels of simple paths from 1 to c is 
P = {1,a, abaa, abaaa, abaab}. Since there all paths from {1,9} to {1,2,7,8,10} pass 
through state 1, the pair (P, S) is a positive factorization for X. 
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[5 | 6/3] 4] 10] 9] 4/3 | 
pif 2 | 7] 8] 10] 1/2 fi] 


jif6]7{4] 10] 1/41 | 
5] 2] 3] 8] 10] 9] 2/3 | 


Table 12.1: The minimal ideal of M 


2 The factorization theorem 


Recall that for a subset X of A*, X denotes its characteristic series. If X is finite, then ae 
X isa polynomial. The degree of a finite maximal code has been defined in Section K ia 
The following theorem is the main result of this chapter. 


THEOREM 2.1 Let X C A* be a finite maximal code and d its degree. Then for some 
polynomials P,Q,S in Z(A), one has 


X-1=P(d(A-1) + (A-1)Q(A-1))5. (12.2.1) 
Moreover, if X is prefix (resp. suffix), one can choose S = 1 (resp. P = 1). 


Note that in all known cases, the polynomial Q has nonnegative coefficients, and more- 
over P,S have coefficients 0,1. Thus, P and S can be view as representing sets of 
prefixes and suffixes. The polynomial Q is not, in general, a characteristic polynomial. 
However, in all known cases, for a finite maximal prefix code, one has Q = yoy U,, 
where U; is prefix closed. However, none of these is known to hold in general. 


ex8.1bis.1| EXAMPLE 2.1 Let 


X = {a?,a?ba?, a?bab, a2b?, aba? , aba?ba? , aba* bab, aba?b?, ababa? , abababa?, (ab)*, 


ababab? , abab?, ab?a, ab3a?, ab? aba”, ab®abab, ab? ab”, abt, ba, b? a”, b? aba”, b? abab, bab”, b3} 


5 
ae the maxi aal prefix code of degree 3 of Example Heg We have, in agreement with 
eorem 


X —1= (1+ ab)(3(A -1)+ (A -1)Q(A - 1)) 


with A = {a,b} and Q = 2 +a +b + ba + (1 + b)ab(1 +a). This can be check directly 
or observing that one has 


X = (1+ab)(a® +a*b(a? +ab+b) +abab(a? +b) +ba+b?a(a+b(a? +ab+b)) +b?) + (ab)* 
We have Q = Se U; with Ui = 0, U> = 1 and U3 = {1, a, ab, aba, b, ba, bab, baba}. 
COROLLARY 2.2 For any finite maximal code X over A, there exist polynomials P, S 


in Z(A) such that 
X-1=P(A-1)S. (12.2.2) 
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Observe that expression BES with P, S having coefficients 0, 1 defines a factorization 
for X, in the sense defined previously. 

The previous result has the followi g cpnverse. Thus finite maximal codes are com- 
pletely characterized by Corollary pat? 


THEOREM 2.3 Let W be a polynomial N(A) without constant term, and let P,S be 
polynomials in C(A) such that 


W-1=P(A-1)S. 


Then W is the characteristic polynomial of a finite maximal code X. If moreover S 
(resp. P) is constant, then X is a prefix (resp. suffix) code. 


Proof. Since W — 1 = P(A — 1)S and since W has no constant term, P and S are 
invertible in C((A)), and we obtain 

A* = SW*P. (12.2.3) 
Denote by supp(T) the support of the series T € C((A)), that is, the set of words having 
a nonzero coefficient in T. 

Define X = supp(W). Then X is finite. We show that X is complete. Indeed, 
let w be any word, and choose f length > deg(S),deg(P). Then uwu appears in 
the left-hand side of Equation eee and we obtain uwu = smp, for some words 
s € supp($), m € X*, p € supp(P). By the choice of u, it follows that w is a factor of 
m. Thus X* N A*wA* is not empty, and X is complete. 

Now we show that 7(X) = 1, where ņ iss me Bernoulli distribution. This implies 
that X is a maximal code by Theorem re É 

Since X is complete and finite, we have 7(X) > 1, by Proposition tet On the other 
hand, we extend 7 naturally to an algebra morphism C(A) — C, and we obtain 


m(W) —1=2(P)n(A — 1)r(S) = 0 


and thus 7(W) = 1. Next, since W has coefficients in N, one has 7(X) < 7(W) = 1, 
and therefore 7(X) = 1. E 

If S is a constant, we may suppose that S = 1 and Equation (za becomes 
A* = W*P. jmilar argument as before shows that X is right complete. Thus by 
Theorem MB., X is a prefix code. 7 


3 Noncommutative polynomials 


Let K be a commutative ring. We begin with a result on division of polynomials 
which is a version of Euclidian division in several noncommutative variables. Given 
two polynomials X,Y in K(A), we say that Y is weak left divisor of X in K(A) if there 
exist polynomials Q, R in K(A) such that 


X = YQ +R with deg(R) < deg(Y). 
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The polynomial R is called the remainder. Observe that in one variable, this relation 
is just Euclidian division. Weak left division is not always possible if A has more than 
one letter (for instance take X = a and Y = b for distinct letters a, b). 

The next result gives a sufficient co dition for the existence (this condition is easily 
seen to be also necessary, see Exercise BaP 


THEOREM 3.1 Let K be a field. Let X,Y, P,Q be polynomials in K(A) with deg(Q) < 
deg(P). If Y is a weak left divisor of XP +Q, then Y is a weak left divisor of X. 


The following consequence is immediate. 


COROLLARY 3.2 If X,Y,X',Y' are nonzero polynomials such that XY' = Y X', then 
Y is a weak left divisor of X and X is a weak left divisor of Y. n 


We fix an order on A and use the corresponding radix order on A*. Given a nonzero 
polynomial P we denote by max(P) the mazimal word (with respect to the radix 
order) appearing in the support of P. One checks easily that max(P + Q) = max(P) 
if deg(Q) < deg(P), and max(PQ) = max(P) max(Q). 


2A 
Proof of Theorem Bar tet Q’ and R’ be polynomials such that 


XP+Q=YQ' +R (12.3.1) 


with deg(R’) < deg(Y). We have Y # 0 since deg(R’) < deg(Y). We may assume 
deg(Y) > 1, since the case deg(Y) = 0 is immediate. The case deg(X) < deg(Y) 
is also easy. So we may assume deg(X) > deg(Y) > 1. Observe that deg(Q) < 
deg(P) < deg(X q deg(R’) < deg(Y) < deg(X) < deg( XP). This show that Q’ 
is nonzero. By Ke we have max(XP) = max(XP + Q — R’) = max(YQ’), and 
max(X)max(P) = max(Y) max(Q’). Hence the word max(Y) is a prefix of max(X) 
and we may write max(X) = max(Y)u for some u *, Hence for some a € K, we 
have X = X’+aYu, with max(X’) < max(X). By E we obtain 


X'P+Q=Y(Q-auP)+R’. 


We conclude by induction on max( X’) that Y is a weak left divisor of X’ and thus of 
X. 7 


Let 71, 2%2,... be a sequence of elements of a ring, of length at least n. We define the n- 
th continuant polynomial relative to this sequence by p(x1,...,%n), where p(z£1,..., £n) 
is the 1,1 coefficient of the matrix 


xı 1 rw 1 [En 1 
1 0 1 0 1 0/` 
It is a simple exercise to show that this matrix is actually equal to 


Ga janes . (12.3.2) 


Dice Ry) p(T2,...,£n—1) 
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Indeed, for the entry in position 2,1 for example, one sees that it is p(£2,..., £n) by 
computing the product of the first matrix by the product of the remaining ones and 
using induction. 

For sake of coherence, the 0-th.c nfinuant polynomial is equal to 1, and the (—1)-th 
is equal to 0. From Equation (esse one deduces that 


Pisces ia n) SP iy eres ie i 8) (12.3.3) 


and 


P(T1,- -- , En) = T1Pp(T2, . - - , En) + P(T3,..-, Fn). 


We often use the latter equation in the form 


Pln,- --, £1) =a iss sa) + PlEn-2,..., £1). (12.3.4) 


By induction, one deduces the Wedderburn relation: 
PT yey abs aL) SP is ey age) (12.3.5) 


3 4 
To prove it, use Equation (Ssh for the left-hand side, Equation (say for the 
right-hand side and induction. 


The next result shows that, ip essence, each relation X Y’ = YX’ in K(A) comes 
from a Wedderburn relation iR 


THEOREM 3.3 Let X,Y, X', Y’ be nonzero polynomials in K(A) such that XY' = YX’. 
Then there exist n > 1 and polynomials U, V, £1,..., £n such that 


X = Up(z1,..., £n), Y’ = p(£n-1,-.., £1) V 
Y = Up(z1,...,£n-1), X’ =p(£n,...,£1)V. 


Furthermore, £1,...,£n—ı have positive degree, and if deg(X) > deg(Y), then £n also 
has positive degree. 


The proof is a simple noncommutative version of t e,uclidean algorithm, obtained 
by iteration of the Euclidean division of Corollary B-2 


Proof. The hypothesis and Corollary Bo iply that Y is a weak left divisor of X. 
Thus X = YQ + Z, for some polynomials Q and Z with deg(Z) < deg(Y); note that if 
deg(X) > deg(Y), then deg(Q) > 0. From XY’ = YX’, we have (YQ + Z)Y’ = YX". 
We set Z’ = X’— QY’. This implies ZY’ = YZ’; since deg(Z) < deg(Y), we deduce 
that deg(Z’') < deg(Y’). Note that Z = 0 <= Z’ = 0. In this case, the result follows 
with n = 1, U = Y, zı = Q and V = Y’. We now assume that Z Æ 0. 

Then we have YZ’ = ZY’, and by induction, there exist polynomials U,V,z,...,2n 
such that 


Y = Up(ti;3 tn); A =p(£n-1,---, £1) V 
Z = Up(z1,...,£n-1), Y' =p(£n,..., £1)V. 
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Moreover, £1,...,£n—1 have positive degrees and, since deg(Z) < deg(Y), £n also has 
positive degree. This, together with X = YQ + Z and X’ = QY’ + Z' gives 


X = U(p(z1,...,£n)Q + p(21,...,2n-1)), Y’ =plan,...,21)V 


Y =p Giger eal X' = (Qp(an,...,%1) + p(4n—1,---,21))V . 
3 4 

The result follows by (EEE and3 with %n41 = Q (recall that Q has positive 

degree if deg(X) > deg(Y)). n 


We shall also need the next result in the proof of Theorem Bith A-—1 playing the 
role of the polynomial of degree 1). For polynomials X, X’, Y we write X = X’ modulo 
Y if Y is a weak left divisor of X, X’ with the same remainder, that is if X = YQ + R 
and X’ = YQ +R. 


THEOREM 3.4 Let B be a polynomial of degree 1, and let x1,...,2n be polynomials such 
that £1,...,£n—1 have positive degree. If B is a weak left divisor of p(£n—1,..., £1) and 
PlEn,..., £1) then p(£1,..., £1) = plti, ..., £1) modulo B for each i =1,...,n . 


To prove this, we need a lemma. 


LEMMA 3.5 Let £z1,..., £n be polynomials. 


(i) p(z1,..., £n) =O if and only if p(£n,..., £1) = 0. 
(ii If the degrees of £1,...,€n—1 are strictly positive, then 1, p(x1),...,p(£n—1,..-, £1) 
have strictly increasing degrees. 


Proof. (i) is proved using the Wedderburn relatioy it Pl tie ty 252500) and 
p(x1,...,%n—1) are both nonzero, and Equations (2-5-3 and eet if they are both 
zero (by induction, jf,one is zero, so is the other). Similarly (ii) is proved by induction, 
using Equation BST n 


-2.4 ‘ ‘ : A z 
Proof of Theorem Bathe proof is by induction. The case n = 1 is evident, so assume 


n>. 
+225 
If p(£n—1,-- et) vanishe h n p(t, ..+;%n—1) also vanishes by Lemma Ba (J. Then 
by Equation (S57 and ( , we have p(z1,..., £n) = p(£1,...,€n—2) and p(£n,..., £1) = 


Suppose that p(£n-1,..., £1) # 0. Then by 


’ 


p(@n—2,---,21). Thus we conclude the proof by i uçtion in this case. 
( 


LnP(Ln—1,++-, 01) + p(tn_2,---,21) = BQ+a 


for some polyno ial Q and some scalar a € K. 
By Lenna EAP yo have deg(p(£n-2,...,£1)) < deg(p(£n-1,..-.,£z1)). Accord- 


ingly, by Theorem , the above equality implies that B is a weak left divisor of 
£n. Hence, £n = y modulo B. By hypothesis, the left division of p(x1,...,2;) and 
p(@i,-..,%1) by B have the same remainder ted 6; fori < n — 1. Since B has 
degree 1, yand all the 6; are scalars. Thus ws implies that 


P(Ti, -++52n) = ony + ôn-2 
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and (say implies 


This proves the claim. n 


p(an,---,21) = Yon-1 + ôn—2. 


We consider now polynomials over Z and Q. A nonzero polynomial P € Z(A) is 
called primitive if the greatest common divisor of its coefficients is 1. The content of a 
nonzero P € Q(A) is the unique nonzero c(P) € Q4 such that P/c(P) is primitive; the 
latter polynomial is then denoted by P. Hence P = c(P)P. Actually, P is the unique 
primitive polynomial such that P = qP for some nonzero q € Q4. 

The next result is the analogue for noncommutative polynomials of Gauss’ lemma. 


LEMMA 3.6 (i) If P,Q in Z(A) are primitive, then so is PQ. 
(ii) If P,Q are in Q(A), then c(PQ) = c(P)c(Q) and PQ = PQ. 


Proof. For (i), if PQ is not primitive, some prime number p divides all its coefficients. 
One obtains a contradiction by reducing coefficients in Z/pZ, since polynomials over a 
field do not have zero divisors. Now (ii) follows easily from (i). n 


In the proof of the next statements, the exponent in the expressions like PQT'R 
refers to the inverse in the ring of series, and not to the residual. 


THEOREM 3.7 Let P,Q, R be nonzero polynomials in Z(A) with (Q,1) Æ 0. Then 
PQ™!R is a polynomial if and only if there exist polynomials P',S,T,Q! in Z(A) such 
that P = P'S, Q=TS, R=TR. 


Proof. The condition is of course sufficient. Conversely, we begin by proving the 
corresponding statement with Z replaced by Q. Then we use Gauss’ lemma to lift our 
conclusion to Z(A). 

1. Consider the set E of pairs of polynomials V = (V1, V2) such that Vj = PQ7!Vs. 
Clearly E is aright Q(A)-module. Note that E contains the pairs (P,Q) and (PQ7!R, R). 
Note also that if the constant term of the second component of V = (V1, V2) € E 
is zero, then Va~! = (Vja~!,Voa7! is in E. Indeed, since (V2,1) = 0, we have 
(PQ-'V2)a~! = (PQ7)(V2a-") and thus PQ~!(Vaa~!) = Via~+. Choose V = (Vj, V2) 
to be nonzero in E and of minimal degree, where deg(V) is the maximum degree of 
its two components. Note that Vi, V2 4 0 since otherwise V = 0. Suppose that the 
constant term of Vz is zero. Let a be a letter such that Vja~! Æ 0. This exists because 
VY, # 0. Then the pair (Viat, Vza71) is in E and has degree less than V. This shows 
that the constant term of Vz is nonzero. 

We show that F = VQ(A). For this, we show by induction on deg(W) that every 
W = (Wi, W2) is of the form W = VT for some poynomial T. We may assume 
that deg(W) > deg(V). If W has constant term zero, then W; = X „e4 (Wia™t)a for 
i = 1,2. Each pair Wa7! = (W,a7!,Woa') is in E by the remark above, and by 
induction Wa~! is in VQ(A). Thus W is in VQ(A). This shows the property when W 
has constant term zero. 

Otherwise since every W = (W1, W2) in E satisfies (W1,1) = 7(W2,1) with y = 
(PQ-1,1), one has (V2,1) Æ 0 and (W2,1) = a(Vo,1) with a = (W2,1)/(V2,1). It 
follows that (Wi,1) = y(We2,1) = ya(V2,1) = a(Vı,1). This shows that the pair 
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W — aV = (W1 — aVj, W2 — aV2) has zero constant term. Using the above argument, 
we have W — aV in VQ(A) and thus W in VQ(A). 

Since (P,Q) and (PQ7'R, R) are in VQ(A), there exists polynomials S and R’ such 
that P = VS, Q = V2S and PQ-!R=V,R’, R = VoR’. This concludes this part with 
P' = Vi and T= V2. 

2. By the first, part, we have P = P'S, Q =TS, R=TR' with P',S,T, R' € Q(A). 
By Lemma pe have c(P) = c(P’)c(S), c(Q) = c(T)c(S), c(R) = c(T)c(R’). Since 
P,Q, R are in Z(A), their contents are in N. Now PQ7'R = P'R' is a polynomial and 
c(PQ-!R) = c(P’)c(R’). From the above, one has c(P)c(R) = c(P’)c(S)c(T)c(R’) = 
c(PQ-'R)c(Q). Since the four factors are integers, there exist factorizations 


c(P) = p's, c(R) =r't, c(PQ-'R) = p'r', c(Q) = st 
for integers p’,s,r’,t. This implies that 
P=)'P'sS, Q =tTs5, R=tTr'R' 


whence the result, since the polynomials p’P’,s5,tT,r’R’ have integral coefficients. 


We shall also need the following result. 


LEMMA 3.8 Let B be a primitive polynomial of degree 1 which vanishes for some integer 
values of the variables. Let P,Q € Z(A) such that B is a weak left divisor of PQ in 
Z(A) with nonnull remainder a. Then B is a weak left divisor, in Z(A), of P with 
remainder B and of Q with remainder y, where Gy = a. 


Proof. Set PQ = BQ’ +a for some aS Z(A) anda € Z, a #0. Since Q Æ 0 (because 
a # 0), we may apply Theorem B.I. Consequently, P = BT + 6, T € Q(A), 8 E€ Q. 
Thus BQ' +a = PQ + BTQ. We have 8 # 0 (since a 4 0, and deg(B) = 1). Hence 
Q = y + BS for some S € Q/A), and y € Q, with a = By. Now, the assumption on 
B and the fact that P,Q € Z(A) imply that 8,y € Z. Since BT = P — G, we obtain 
by Gauss’ lemma c(B)c(T) = c(P — 8) € N, hence c(T) € N, because B is primitive. 
Thus T € Z(A). Similarly we obtain S € Z(A). n 


Finally we prove the following lemma which will be used later. 


LEMMA 3.9 Ifai,..., an € Q(A), then p(ai,...,an) and p(an,...,a1) are both zero or 
have the same content. 


Proof. By induction on n. Recall the Wedderburn relation 
plai, ...,an)plan—1,...,01) = plai, ...,an—-1)Plan,...,a1). 


Assume p(a1,...,@n) = 0. By the Wedderburn relatio „either plai, ...,@n-1) = 0 or 
plan, ...,a1) = 0. If p(ai,...,an—-1) = 0, then by E one has p(a1,...,an-2) = 0. 
By jndugtion, this implies p(an_1,...,@1) = 0 and p(an_2,...,a1) = 0 which implies 
by Be Wig ...,a1) = 0. 
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also p(Qn—1g-4-, 41) = 0 by induction. By , c(p(ai,... an) = c(p(ai, ...,an—2) 
and by (at c(p(an,---,@1) = C(p(@n—2,---,@1)). The conclusion follows by induc- 
tion. Otherwise Gauss’ Lemma and the Wedderburn relation give 


Assume now p(a1,..., an) #0 and p(an,....4 )a# 0. If p(ai,...,@n—1) = 0, we have 
(a 


c(plai,...,an))c(plan-1,...,a1)) = c(plai,...,an—-1))c(plan,...,a1)). 
By induction, c(p(a1,...,@n—1)) = c(plan-1,...,a1)) and thus we obtain the conclu- 
sion. 7 
4 Proof of the factorization theorem 


Given a word u and a series T € Z((A)), the residual of T by u is defined by 


uo TS > (T, uw)w. 


we A* 


This is consistent with the definition given in Chapter Fr “Observe that (uv)-!T 
v ‘(u+T). The notation Tv! is defined symmetrically. Note that u~'(Tv~') = 
(u-'T)v—!. Here, the exponent refers to the residual and not to the inverse. 

Given a code X and words u,v, we define S(u) = {s € A* | us = z1- £n, £i € 
X,|s| < |£n|} and P(v) = {p € A* | pu = 21 --- tpn, £i E€ X,|p| < |xı|}. These are the 
sets C,(u) and C;(v) of right and left contexts of u and v already defined earlier. 


LEMMA 4.1 Let X be a finite code. For each pair of words u,v, there exists a finite set 
F(u, v) such that 
u1X*v-l = $(u)X*P(v) + E(u, 0). (12.4.1) 


Proof. The series u~!X*v7! is the characteristic series of the set W of words w such 
that uwv € X*. Let F(u,v) be the set of words w such that uwv = xyz for some words 
z,z E€ X* and y € X with z a prefix of u, z a suffix of v and w a proper factor of y. 
Since X is finite, this set is finite. 

Let us verify that W is the disjoint union of S(u)X*P(v) and F(u,v). Indeed, 
S(u)X*P(v) and F'(u, v) are contained in W. They are disjoint since if w € S(u)X*P(v)N 
F(u, v), then uwv has two distinct factorizations £1£2--- £n with x; € X, one in which 
w is a proper factor of some x; and the other in which it is not. 

Conversely, given a word w such that uwv = z1 :--£n, with z; E€ X, either there is 
an index i such that x; = swp with 71---2;_,u' = u, and v = V'£i+1 En, and both 
u’, v’ nonempty. In this case, WE F(u,v). Otherwise, w € S(u)X*P(v). 

This proves Equation (ear 


LEMMA 4.2 Let X be a finite maximal code of degree d. Then there exist words 


Ui,- .., Ud; V1,-.-,Uq with uz,v1 E X*, such that for any 1 <i,j7 <d: 
Ho —-ly*,-l _ —] y*,,—1 
A= Sa Xe = u a: 
1<é<d 1<k<d 
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Proof. Let A = (Q,1,1) be an unambiguous automaton recognizing X*, set Y = YA 
and let M = y,(A*) be the transition monoid of A. Let G be an H-class of the minimal 
ideal of M that meets y(X*), and let e be its neutral element. The set H = GN y(X*) 
is a subgroup of index d of G. In particular, e € y(X*) and y!(e) C X*. 


Let w1,...,Ug, V1,- .-, Uq be words in y~1(G) such that 
G= U p(vj)H = U Ho(u;). 
1<i<d 1<j<d 


We may assume that y(ui) = y(v1) = e, and that y(u;) is the inverse of y(v;) in G. 
It follows that u1, v1 E€ p-l(e) C X*. Fix j, 1 <j <d. Let w € A*. Observe that 
p(vj;) E G, hence that ep(wv;) = ep(w)p(v;) = ev(w)y(v; Je is in eMe = G. Thus 
ey(wv;) is in some y(v;)H, for some uniquely determined i, depending on w. We show 
that: 
ey(wr;) E€ p(u)H & uw; E€ X*. 

Indeed, ey(wv;) E plu)H & v(uj)ey(wr;) E€ g(u)p(ui)H & v(uwr;) €E Ho 
u;wvj E X* (since y(u;wvj) = ep(uzwu;)e € G). 

Thus we obtain that for any w in A*, there is a unique 7 such that w € u Xto’, 
which implies the second equality in the lemma and the first by symmetry. n 


The following lemma is easily derived. 


LEMMA 4.3 Let X be a finite maximal code of degree d. There exist finite sets P, S C 
A* Pi, 5, C A* with 1 € Pi, Sı and Lı, Rı C At and a polynomial Q with coefficients 
in N such that 
(i) dd*=Q4+SX°P. 
(i) AX =L1+SX*P) = Bi +SiX°P. 
(iii) If S1 = {1} (resp. Pi = {1}), then X is prefix (resp.suffix). Conversely, if X is 
prefix (resp.suffiz), then one can chose Sı = {1} (resp.P, = {1}). 


23.2 : P . 
Proof. According to Lemma E tiere exist words u4,...,Ud,U1,---,Uq With u1, v1 in 


X* such that 
x o —-ly*,-l _ —-ly*,-—l1 
As Soi ee Ss SS a ao 
1<é<d 1<k<d 


3.1 
By Lemma E 


where S(u;), P(v;), F(ui,v;) are finite sets. Thus, for any i, j =1,...,d, 


At = SY) Su) X*P(w) + J, E(u, ve) (12.4.2) 


u Xo = S lu) X P (vj) + F (ui, v9) 


1<t<d 1<t<d 
= X S(up)X*P(oj)+ XO Elur v). 
1<k<d 1<k<d 


.2 
Let P = Ui<ceca P (ve) and S = Ui<cgca S (uk). Observe that, by BETA the unions 
are disjoint and therefore 
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Let Py = P(a,), S1 = S(ui). Let Ly = Ui<i<a F(u; vi), Ri = Ui<j<a F(u1, vj) which 
are again disjoint unions and finally Q = ; gaar t vj). 

Summing up both sides of Equation ( or i = 1,...,d, one gets assertion (i). 
Assertion (ii) is a rephrasing of the equations of i = 1 resp. j =1. 

Since uj, v1 € X*, one has 1 € S(u1) and 1 € P(vı). By (ii), we have (£,,1) + 
(SX*Pı,1) = 1. Since 1 € S and 1 € P, this implies 1 ¢ Lı. This finishes the 
verification of the properties of the finite sets. 

It remains to prove (ii). 

If X is prefix, then X* is right unitary. Thus the set of right contexts Sı = S (u1) is 
reduced to the empty word. 

Conversely, if S; = {1 ; pe ae A* = Rı +X*P. We show that X is right complete 
and hence, by Theorem MB., that X is a prefix code. Indeed, let w be a word, and let 
u be a word longer than any word in Ry and in P. The word wu is not in Rj, there it 
is in X*P. Consequently, w is a prefix of a word in X*. This completes the proof. n 


.1.1 
Proof o heorem PF For convenience, we set B = 1 — A. With the notation of 
Lemma #3) one has A* = Lı +S X*P,. Thus $X*P, = B-'(1— BL). Hence 


BS X* a8 Po: 


.3 
By Lemma EOR we have d— BQ = BS X*P. Thus, replacing BS X* yields d— BQ = 
(1-— BL,)P B This implies 


P = Pı(1— BL)! (d — BQ). 


We apply Theorem Britto the last equality and we obtain the existence of E, F, G, H 
in Z(A) s ch ghat Pı = EF, 1 — BL, = GF, d— BQ = GH, P = EH. Thus 
Lemma Bae lies that G = +1 (we write P = a as a shorthand for saying that a is 
the remainder of the weak left division of P by B). Replacing | PA E, F,G, H 
by their negatives, we may suppose that G = 1. Then Lemma again implies that 
H = d. Thus 


P = E(d+ BI) (12.4.3) 


for some I € Z(A), 
By Lemma i), we have B~!(1 — BR) = A* — Ry = S1 X* P. Hence 


1—-X=P(1-—BR,)'1BS,. 


This is very close to Equation BEN, but wit, â çentral inverted polynomial, which 
we must eliminate. For this we use Theorem again. There exist J, K, L, M in Z(A) 
such that P = JK, 1— BR, = LK, BS; = LM, 1— X = JM. Let 7 be a positive 
Bernoulli morphism. It extends linearly to an algebra homomorphism Q(A) > R. 

We may assume that 7(K) > 0. Then we deduce from Lemma B.Y that K = 1+ BK’ 
and L = 1 + BL’ for some K', L’ in Z(A). Thus BS; = (1 + BL')M =M+BL'M, 
which implies that M = BM’ for some M’ in Z(A). Thus 


1—-X=JBM'. (12.4.4) 
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3 1 
Equation (ray will imply Equation EEN, if we show that J is of the form 
Jı(d + BJ2). This is the most technical part of the proof. It will follow from 


E(d+ BI) = J(+ BK’). (12.4.5) 


(which holds in view of EREIN the fact that P = JK and K = 1+ BK’) and 
from the divisibility property of Theorem e difficulty is that in this theorem, 
the polynomials involved have coefficients in Q. Therefore a lot of work is required to 
draw the conc ugign in Z. à 

Theorem applied to Equation (aay guarantees the existence of polynomials 
T1, --, Ln, U,V in Q(A) such that 


E = Up(z1,..., £n), d + BI = p(£n-1,...,£1)V, 
J = Up(£1,...,£n-1), 1 + BK' = p(an,...,21)V. 


We write pi, qi for p(x1,..., £i) and p(x;,...,21). We apply Theorem Britto the two 
equalities at the right, and obtai that Qn—1 and qn are both congruent to a scalar 
modulo B. Thus Theorem “implies that pn—ı1 and gn—1 (res -o Pn and qn) are 
congruent to the same scalar modulo B. Furthermore, by Lemma B.I, C(Pn—1) = C(dn—1) 
and c(pn) = clqn). 

Observe that 1 — BR, is primitive, since Rı has coefficients 0,1. Thus the equation 
1 — BR, = LK implies that L, K are primitive, since m are in Z(A). We have 
K = 1+ BK’ = mV, hence by Gauss’ lemma, c meV = c(K) = 1, and qV = 
K = K. This equality together with Lemma gt that V = e+ BV’, with 
V’ € Z(A) and € = +1. Now 1— X = JM and 1— me is primitive, hence J is primitive. 
Since JK = E(d+ BI), Gauss’ lemma again implies that d+ BI is sae ee 


d+ BI = qn_-iV, the same lemma implies that d+ BI = G,_1V. Lemma now 
implies that G,-1 = «d+ BN for some N € Z(A). 

We have seen that pn—-1 and gn; are congruent to the same scalar modulo B and 
that c(pn-1) = C(qn—1). Hence p,—1 and g,—1 are congruent to the same scalar modulo 
B, and we have pn_; = ed + BH with H € Q(A). But p,_1; — ed = BH and B is 
primitive, so that by Gauss’ lemma, c(H) = c(pn—1 — ed) is in Z and H is in Z(A). 

Now, J is primitive and J = U, a1 hence J = J = Upn_1, which implies J = 
U(ed+ BH). Thus Equation B implies 


1— X =U(ed+ BH)BM’. 


This implies that for some polynomials W, Y, Z in Z(A) (defined by W = +U, Y = +H, 
Z = +M’) and e; = +1, one has 


1- X =W(edB+BYB)Z, (12.4.6) 


with 7(W),7(Z) > 0. 

Now define the linear mapping à : Q(A) > R by A(w) = |w|z(w) for each word w in 
A*. It is easily sh that A(P1P2) = A(Pi)n(P2)+n(P1)A(L 2), for P 1, P2 inQ(A). 
Applying A to iind observing that A(B) = —1, we obtain A(X) = n(W)eıdr (Z). 
Since A(X) > 0, this shows that € = 1. 
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To conclude the proof o hegrem Ett observe that if X is prefix, then one can 
choose S1 = 1 by Lemma i); since BS; = LM and M = BM", we obtain that 
B = LBM'. Thus M’ = +1. Since 7(Z) > 0 and Z = +M, we deduce Z = L 

On the other hand, if X is suffix, one can choose Pı = 1 by Lemma SH again. 
Since Pı = EF, we obtain E = +1. Since E = Upp, we obtain by Gauss’ lemma, 
+1 = Üpn, hence W = +U = +1. Since 7(X) > 0, one has W = 1. n 


REMARK 4.4 A closer loo at, the previous proof proves the following claim: under the 
hypothesis of Theorem 2-1, one has 


X-1=W(d(A-1)+ (A-1)Y(A-1))Z, 
and moreover 
P,=W(14+(A-DW’), S$1=(14+2'(A-1))Z, 
for some polynomials W, Y, Z, W’, Z’ in Z(A), and in particular 
mW)=n(Pi), 1(Z) =7(S1). 
3.3, 
Recall that P 1,51 are as defined in Lemma fey aid its proof, and therefore satisfy: 
up X*=SiX*, X =X P: 


for some words u1, v1 in X*. Note that the average length ` uey 7(w)|w| of X is equal 
to A(X) = 1(W)dx(Z). or 

We prove the clai by going through the proof of Theorem 2 i we have P; = EF, 
F = 1 (by Lemma Rae. G = 1 and GF = 1), E = Up, (by Gauss’ lemma, since 
E = Uppy, i E being primitive since Pı is and Pı = EF); furthermore g, = +1 


(by Lemma B.¥, since mV = K = 1), which i pfigs, by an argument similar to that 
for pn—, and gn—; in the proof of Theorem | that p, = +1; thus we obtain that 
PnF = 1, and Pı = P} = Up,F, which is the product of +W with a polynomial 
which is = +1. Since 7(P1) > 0 and r(W) > 0, we obtain finally that P is of the 
desired form W(1+ (A — 1)W’). 

On the other hand, Z = +M’, M = BM’, BS; = (14+ BL')M. Thus BS, = 
(1+ BL')BM"', which implies that S$; = (1 + L/B)M’, and 7($1) = 2(M’). Since 
($1) >0 and 7(Z) > 0, we have in fact S1 = (1+ L’B)Z, which proves the claim. 


5 Applications 


Let m be a Bernoulli morphism. Recall that the average length (with respect to 7) of 
a finite code X is the number J „ex 7(w)|w|. Moreover, the measure 7 is said to be 
positive if m(w) > 0 for any word w. 


: . . . -3.4 
The following statement is easily obtained from Remark Et onein aie same 


result holds for arbitrary thin complete codes, as proved in Corollary 


COROLLARY 5.1 Let X be a finite maximal code and m be some positive Bernoulli 
measure. The average length of X is greater or equal to the degree of X, with equality 
if and only if X is bifix. 
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3.4 

Proof. With the notation of Remark 4, we have 7(W) = m(P1) and x(Z) = ($1). 

By Lemma “TUS 1) > 1 (resp. (P11) > 1), with equality if and only if Pı = 1 

(resp. S; = 1). Thus, since the average length of X is equal to A(X) = 1(W)dz(Z), 
we obtain that it is > d. 

i g $ 3.3 

If equality holds, then we must have Pı = Sı = 1. Then X is bifix by Lemma $ il). 


Let x be any word and X a finite code. Consider the set E(x) of pairs of words (u, v) 
such that uxv is in the submonoid X* generated by X; this set has a natural partial 
order defined by (u,v) < (u’,v’) if for some m1, m > Repas u’ = miu, v = vmo 
(so that u’mv’ = m,umumz2). Recall from Section ae of x with respect 
to X is an element in E(x) which is minimal for this order. In other words, a context 
of x is a pair (p,s) such that: either prs = c1 +--+ Cn, G E X, n > 1, with p a proper 
left factor of cı and s a proper right factor of cn, or prs = 1: for x = 1 (and also for 
x € X*) (1,1) is a context. Observe that the set of contexts of a word is finite. Its 
measure is by definition X` a(p)m(s), where the sum is over all i (p, 8) of x. 

The next result is eaily obtained with the help of heo em The same result holds 
for an arbitrary thin complete code (Theorem Ki ee 


COROLLARY 5.2 The measure of the set of contexts of a word with respect to a finite 
maximal code is equal to the average length of this code. 


We prove in fact a noncommutative version of this result. 
Proof Fix a finite maximal code C and a word x. Define a mapping e from Z((A)) into 
the complete tensor product Z((A)) 9z Z((A)), by e(w) = Voy U Qv. It is easily 
seen that e(A*) = A* @ A*. Furthermore, the very definition of a context implies that 
e(C*) = >7,,,C*p ® sC*, where the sum is extended to all contexts (p,s) of x with 
respect to C. “Thus e(C*) = (C* 8 1)X (1 @C*), where X = }>p®@ s, summed over all 
contexts of x. 

Suppose that x is nonempty; then we have for any words s,m, p: 


e(smp) = (s 8 1)e(m)(1 8 p) + e(s)(1 8 mp) + (sm 8 1)e(p) 
+ J, (suo @ (v'm)p+ s(mu~*) @ v*p) 
u,vAél,c=uv 


+ 5 (umv, x)su™t @ v!p, 
u,vÆl 


ction8.3 

where we use u~! in the same way as the notation recalled at the beginning of Section jess 
and where (,) is the scalar product on Z(A) that has A* as an othonormal basis. 

The proof of this formula follows by inspection, once the 6 possibilities for the word 
x to be a factor of the word smp have been observed: either x appears as a factor of 
m, or of s or p, or x is an overlapping factor of the product sm or mp, or finally x is 
factor of smp which starts properly in s and ends properly in p. 

Note that the previous formula is lin a ip each of s,m,p, so it extends to series 
S,M,P. Now we have by Corollary | AY = SC*P, where P,S are polynomials. 
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Hence we obtain 


A“ 9 A* = e(A*) = e(SC"P) 
= (S @ 1)e(C*)(1 & P) + e(S)(1 @ C*P) + (SC* @ 1)e(P) 
+ J (Sun) @ (vt C*)P + S(C*u-t) 8v P) 
u,vAl,c=uv 
+ > (uC*v,2)Su? @ v1 P. 
ujvAl 


Note that the last summand,is a finite sum, R, and that e(C*) =(C* @ 1) X(1 ec"). 
By the proof of Lemma f-I, where S(v) and P(u) are defined, we thus have 


A“ ® A* = (SC* @ 1)X(1 8 C*P) + e(S)(1 8 C*P) + (SC* @ 1)e(P) 
+ So (Sul @S(v)C*P+SC*P(u)@v'P) +R. 


u,v4l1,c=uv 


Let us multiply by PB & 1 on the left and by 1 @ BS on the right. Since PBS is the 
inverse of C* ,we obtain 


P@S=X+(PB®@l1)e(S)+e(P)(1 8 BS) 
+ S° (PB(Sué*) @ S(v) + P(u) @ (v'P)BS) 


u,vál,x=uv 


+ (PB Q 1)R(1 9 BS) 


Note that when x is the empty word, then formula for e(smp) has to be slightly mod- 
ified: the X’s are replaced by —s ® mp — sm & p, and from here on the argument is 
similar and hence we omit it. 

This shows that the sum of the contexts of the word x is equal to P & S modulo the 
two sided ideal of Z(A) ® Z(A) generated by A—1@1 and 18 (A-1). 

The homomorphism 7 ® 7 : Z(A) ® Z(A) — R vanishes on this ideal. Thus the 
measure of the set of contexts is equal to 7(P)7(S). Now, using C — 1 = P(A—1)S, 
we find that the average length of C is equal to A(C) = m(P)z(S). n 


on4.4 
A code of degree 1 is called synchronous, see Section N ig Tf Cis a finite set of words 
in A*, denote by C the sum in Z|A] of the commutative images of the words in C. 


COROLLARY 5.3 Let C be a finite maximal code on the alphabet A. Then C — 1 is a 


multiple of A—1. If the quotient of these two polynomials is irreducible in Z[A], then 
C has at least two of the following properties: prefix, suffix, synchronous. 


a 10 
REMARK 5.4 Note that the first assertion is already a consequence of Theorem fiers 


It implies the following nice result on the cardinality of the code: |C| = 1 mod.(|A|—1). 
In fact, with the notation of Corollary pati has |C| = 1+ psmod.(|A| — 1), where 
p, 8 denote the image of P, S under the homomorphism sending each letter onto 1 (and 
thus C onto |C|). Note that in the particular case where C is a finite prefix code, 
this equality on the cardinality has the following interpretation: ps is the number of 
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internal nodes of the |A|-ary tree representing C’, and the previous equality comes from 
the corresponding equalit relating the number of external and internal nodes of a 
rooted tree, see Exercice Fate previous equality also has some analogy with the 
Schreier formula asserting that if H is a subgroup of the free G group generated by A, 
then H is free of rank h, with h =1+|G/A|(|A| — 1). 


Proof of the Corollary. zà 

Let p the canonical homomorphism Z(A) — Z[A]. Then by Remark Far we have 
C-—1= p(X)p(Z)(d + p(Y)(A — 1))(A — 1), which proves the first assertion. If the 
quotient is irreducible, then we must have two of the three following equalities: p(X) = 
+1, p(Z) =+1,d+ p(Y)\(A- 1) =L 

The equality p(X) = +1 implies, by Remark 3.4, that 7(S1) = 1, hence Sı = 1, and 
then that C is prefix (Lemma vi)). We deal with the second equality similarly. 

If the third equality holds, then we must have p(Y) = 0, and d = +1, which implies 


d = 1, hence C is synchronous. n 


6 Commutative equivalence 


Recall that the canonical morphism that associates to a formal power series its com- 
mutative image is denoted by 


a: Q((A)) > QIMA] 


and that a(A*) = A® is the free commutative monoid on A. By definition, for each 
a € QU(A)) and w € A®, 


Two series 0,7 € Q((A)) are called commutatively equivalent if a(o) = a(r). 

Two subsets X and Y of A* are commutatively equivalent if their characteristic series 
X and Y are so, which means that a(X_) = a(Y_). In an equivalent manner, X and 
Y are commutatively equivalent if and only if there exists a bijection y : X — Y such 
that y(x) € a~ta(z) for all x € X. 

A subset X of A* is called commutatively prefix if there exists a prefix subset Y of 
A* which is commutatively equivalent to X. 


EXAMPLE 6.1 Any suffix code X is commutatively prefix (since X is prefix). More 
generally, any code obtained by a sequence of compositions of prefix and suffix codes 
is commutatively prefix. In particular, our friend X = {aa, ba, baa, bb, bba} is commu- 
tatively prefix. 


EXAMPLE 6.2 Let A = {a,b} and let 
X = {aa, ba, bb, abab, baab, bbab, ab? , a? ba? a>b?.ab, a®ba*b, a3babab} 
i hiş sef js gasily verified to be a code, by computing, for instance, the sets U; of Section 
U; = {abb, aba”, ab°ab, aba®b, (ab)? , ab}, Uz = {ab}, U3 = {ab} 


J. Berstel, D. Perrin and C. Reutenauer Version 22 janvier 2008 


st8.6.1ibis 


458 XII. The Polynomial of a Code 


Further, X is maximal since for 7(a) = 7(b) = 4, we obtain 7(X) = 1. Finally X is 
commutatively prefix since 


Y = {aa, ba, bb, abab, abba, abbb, abaab, aba“ , aba*b*, aba? ba”, aba? bab} 


is a prefix code commutatively equivalent to X. 
Observe that 


X-—1=(1+a+b+aľb+ aba) (a +b-— 1)(1 + ab) 


is a positive factorization for X. Actually, elongs, to the family of indecomposable 
finite maximal codes described in Exercise 


EXAMPLE 6.3 Let A = {a,b} and let X C a*ba*. Then X is commutatively prefix if 
and only if for all n > 1 
Card(X N ACHD) <n. (12.6.1) 


(recall that A"+) = 1U AU... UA”). The condition is necessary. Indeed, let Y 
be a prefix code commutatively equivalent to X. Since Y is prefix, the map 7 from 
XNA") to {0,1,...,n—1} defined by 1(a‘ba’) = i is injective. Thus we cannot have 
more than n words of length at most n in X. Conversely, suppose that the condition is 
satisfied. We show by induction on n > 1 that there is a prefix code Y commutatively 
equivalent to X1 U...U Xn with Xn = XNA”. This is true for n = 1. Assume that it is 
true for n > 1. Set I = {i > 0 | a'ba* NY Æ Ø}. Then Card(I) = Card(X n.A\+) and 
thus Card(I) + Card(X,41) < +1. Thus we can choose Z commutatively equivalent 
to Xn41 formed of words abaf with distinct indices i € {0,1,...,n}\ Z. The code 
Y UZ is prefix and commutatively equivalent to X1 U... U Xn41. 


THEOREM 6.1 For each subset X of A* the following conditions are equivalent: 


(i) X is commutatively prefix. 
(ii) The series (1 —a(X_))/(1—a(A)) has nonnegative coefficients. 


The proof uses the following lemma. 


LEMMA 6.2 Let U C A* and V € Z((A)) be such that (a(U_),w) > (a(V), w) > 0 for 
all w € A®. Then there exists U' C U such that a(U') = a(V). 


Proof Let w € A®. Since (U,a7!(w)) > (a(V),w) > 0, there exists a subset Uw of 
U Na™!(w) such that (a(Uw ), w) = (a(V),w). Then U’ = UweaeUw is a subset of U 
and (a(U"),w) = (a(V), w). = 


Proof of Theorem Bay ay = (ii). First assume that X is commutatively equivalent to 
some prefix set Y. Let P = A*—YA*. Then A* = Y*P, hence 1—Y = P(1—A). Thus 
1-a(X_) =a(PL)(1—a(A)). Clearly a(P ) = (l-a(X_))/(1—a(A)) has nonnegative 
integral coefficients. 

(ii)=> (i). Let X, = XNA” for n > 0. Set Q = (1 — X)A*. Then a(Q) = 
(1—a(X_))/(1—a(A )) has nonnegative coefficients. Note that, since Q(1- A) = 1-X, 
we have for 1 <i<n 

Qi = Qi1A — Xi (12.6.2) 
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where Q; is the homogeneous component of degree i of Q. 

We show by induction on n > 1 that there exists a prefix code Y commutatively 
equivalent to X1 U... U Xn. The property is true for n = 1 since Y = Xj, satisfies the 
condition. 

Suppose that the property is true for n > 1. Let P = A* \ YA*. Thus 1- Y = 


P(1—A). Set Y, = YN At and P; = PAN Æ fr 0 <i <n. Since 1—a(X) = 


a(Q)(1 — a(A)) and 1 — a(Y_) = a(P)(1 — a(A)) coincide up to degree n, we have 
a(Qi) = = a(P; i) for0 <i <n. Since Qn+1 = QnA —Xn41, the polynomial QnA —Qn+1 
has nonnegative coefficients. i implies that a(PaA)—a(Qn+1) also has nonnegative 
coefficients. Thus, by Lemma p.2, we can choose a subset P,,.1 of P „A in such a way 
that a(Pa1) = a(Qn41): 
We define Yn+1 = Ph A\ Pri. Then Y UYn+1 is prefix and commutatively equivalent 
to X,U...U Xn41. 


It is interestin to note the connexion of this statement with Kraft’s inequality [f 5.10 
(see Exercise -7 


COROLLARY 6.3 A factorizing code is commutatively prefix. 


Proof. Let X C At be a factorizing code and let (P,Q) be a factorization of X. 
Then by definition 1 — X = P(1—A)Q. Passing to commutative io. gives 
1—a(X_) =a(P)(1—a(A))a(Q ) or also (1 — a(X_))/(1—a(A)) = a(P a 


a(P )a(Q ) has nonnegative coefficients, the conclusion follows from eee 
Now we give an example of a code which is not commutatively prefix. 


bles. 1 
EXAMPLE 6.4 Let X C a*ba* be the set given in Table ff with the convention that 
a’ba) € X if and only if the entry (i D gontains a 1. Clearly X C A06) and Card(X) = 
16. Thus according to Example 6.3, X is not commutatively prefix. 
Let us show that X is a code with deciphering del . Let x,y,2,t E€ X be such 
that xy < zt. We may suppose x < z. Then (see Fig. jaa) we have 


x =a'ba),y = ařalba”, z = xa’ ,t = alba” 


bles. 1 
The 1’s representing x and z are in the same row in Table Fj Thus necessarily k € 
{0, 1,2, 4,6,7, 12,13,14} since these are the distances separating two 1’s in the same 
row. Next, the dots representing y and ¢ are in rows whose difference of indices is k. 


Thus k € {0,3,5,8, 11}. This pa gk = = 0 and thus z = z. 


Observe that, by Corollary e code X is not contained in a factorizing code. 


We have mentioned in Section 1 that all finite maximal codes known up to now are 
factorizing. A fortiori they are commutatively prefix. If ns KRUN holds for all finite 
maximal codes, then it follows that the code of Example cannot be completed into 
a finite maximal code. 

As a matter of fact, we do not known at present whether it is decidable that a finite 
code can be included, in a finite maximal code. 

We use Theorem to prove the following statement. 
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0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 


O;1 1 1 1 1 
1 

2 

3 | 1 1 1 1 
4 

6 

7 

8 | 1 1 1 1 
9 

10 

11/1 1 1 

12 

13 

14 

15 


Table 12.2: A Code X Which Is Not Commutatively Prefix 


Figure 12.4: If X where not a code 8_06 


THEOREM 6.4 A circular code is commutatively prefix. 
We first prove the following lemma. 
LEMMA 6.5 Let X C A? be a circular code. Then the series 
log a( A) — log a(X.*) 
has nonnegative coefficients. 


Proof We have, denoting by L the set of Lyndon words, and by L’ the set of Lyndon 
words whose conjugacy class meets X*, 


log A* — log X* = doe =A) — log(1 — X)“! 
Z pe AL 
E n n 
n>1 n>1 


Now, we have a(A)” = Donen Dipin po(@)? since the conjugacy class of x? has |x| = n/p 
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elements. And, since X is circular, a(X)” = diper a(x)”. Thus 


yetyy 


xEL p>1 EL! p>1 


=: 


reEL—-L'n>1 


log a(A*) — log a(X*) 


Thus the series s = log a(A*) — log a(X*) has nonnegative coefficients. n 


-6.3 
Proof of Theorem ince a(A*—X A*) is the exponential of s = log a(A*)—log a(X*), 
it also has nonnegative co cients. This is equivalent to the fact that X is commuta- 
tively prefix, by Theorem a n 


Note ‘eee greular code need not be cyclically equivalent to a prefix code (see 
Exercise : 

We now consider the problem of the commutative equivalence to synchronized codes. 
The period of a set of words is the greatest common divisor of the lengths of its elements. 


Two commutatively equivalent sets have the same period. If a finite ot atte — 


code X has period p, then X = Y o AP and thus d(X) = d(Y )p by Proposition XII} 
In particular, a finite maximal prefix code X of period p > 2 is not synchronized. The 
following result shows that this is the only obstruction. 


THEOREM 6.6 Any finite maximal prefix code of period 1 is commutatively equivalent 
to a synchronized prefix code. 


The proof relies on three lemmas. Since the only synchronized maximal prefix code 
on one letter is {a}, we may assume that the alphabet has at least two letters. 
We denote by P the homogeneous component of highest degree of a polynomial P. 


LEMMA 6.7 If X is a finite maximal prefix code of period p such that 
X—-1=L(A-1)R 

where R = A” for somen > 1, then R is a polynomial in A dividing 1+A+...+ APH. 

Proof. 1. Let E = (A — 1)R. We first show that E is a polynomial in A. Let us prove 


by descending induction on m < n that 


n+1 
E=E'+ X sið (12.6.3) 
i=m+1 


with deg(E’) < m. The property is true for m = n since EB = AR = AH, Suppose 
that it holds for m < n. Let g be in the support of L and let h be a word of length 
m. For all words k of length n — m + 1 we have ghk € supp( LE) C supp(X) and thus 


ghk € X. Since X is pre X and k # €, we have (LE, gh) = 0. 
( 


But, by Formula we have 
t—1 
(LE, gh) = (L, g)(E',h) + XC (L, gi)Sttm-i (12.6.4) 
i=0 
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here g; is the prefix of length 7 of g and t = \g|. Since (LE, gh) = 0, we deduce from 
SEI the formula 


t-1 
(E',h) = —(1/( (L 9) Sic (L ,gi) St+m—i 
i=0 


1 
It shows that (£’,h) does not depend on the word h and proves that (EADE is true 
for m — 1. Thus we have proved by induction that E is a polynomial in A, i.e. 


n+1 


E= 3 s; A’ 
i=0 


Thus R is also a polynomial in A. 
2. Let x be a word of X and let q = |z|. Let Z, s be the polynomials in the variable z 


n+1 


q 
(z) = y s(z) = > siz’ 
i=0 i=0 


where 4; is the coefficient in L of the prefix x; of length i of x. We have for each integer 
m such that 0<m<q 


(LE,2m)= Yoa 


i+j=m 


(we set s; = 0 for j > n+ 1). Suppose that 0 < m < q. Since X is prefix and 
X — 1 = LE, we have (LE, £m) = 0 and thus 


5 lisj =0 


i+j=m 
Since (LE, x) = 1 and (LE, €) = —1, we therefore have 


z1 — 1 = k(z)s(z). 


This shows that E divides AY — 1 and that R divides 1+ A+ ...-+ A171 for each q such 
that X contains a word of length q. This proves the lemma. n 


The second lemma is a simple property of commutative equivalence. For a nonempty 
finite set X of words, we denote by deg(X) the maximal length of the words of X and 
by X the set of words of X of length deg(X). 


LEMMA 6.8 Let Y be a maximal prefix code on the alphabet A with Y = AR and 
deg(R) =n. If R # A”, then Y is commutatively equivalent to a prefix code Y’ such 
that Y' is not of the form AR’ and, in particular Y' #4 Y. 


Proof We use an induction on n to prove in a first step that for a nonempty set R strictly 
included in A”, there exists a word h and letters a,b such that (ha)~!R 4 (hb)-!R 
(note that one of the sides can be the empty set). The property holds trivially for n = 0 
since then R is equal to {e} = A®. Assume, for some n > 1, that it holds for n — 1. 
If for some a € A, the set S = a~!R is nonempty and not equal to A”~!, there exists, 
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by induction hypothesis, a word g and letters b,c such that (gb)~'S 4 (gc) 1S. Then 
the assertion is proved with h = ag. Otherwise, we have a~!R = A”! or a™!R = Ø 
for each letter a. Since R Æ Ø and R 4 A”, the sets a~'R cannot be all equal. Thus, 
there exist letters a,b such that only one of the sets a~'R,b-!R is empty. Then the 
conclusion holds with h = 1. 

For h,a,b as above, let U = (ahb)~ 1Y, V = (bha) tY. Then U = (hb)~!R and 
V= (ha) 1R. Thus U 4 V. Let Y = W + ahbU + bhaV with the three terms of the 
sum disjoint. Then Y’ = W +ahbV +bhaU is commutatively equivalent to Y. Suppose 
that Y’ = AR’. Since V = (bha)~!Y, we have 


V = (bha) t? = (ha)~!R = (aha) tF = (aha) 'W = (aha) t?" = (ha)! R’. 
On the other hand, we have 
Ü = (bha) tF" = (ha)! R' 


and thus we obtain U = v, a contradiction. 


For a finite maximal prefix code X, we denote by e(X) the integer defined by 
e(X) = max{e > 0 | X — 1 = L(A — 1)R, e = deg(R)}. (12.6.5) 
LEMMA 6.9 Let X be a finite maximal prefix code such that 
X-1=L(A-1R (12.6.6) 


with deg(R) = n > 1 and R # A”. Then there exists a prefix code X' commutatively 
equivalent to X such that 
e(X") < e(X) 


3 “aA Ao gA 
Proof. We first note that (ogy implies that X = LAR. Therefore the homogeneous 


eq8.5.2bis 


polynomials L, R have 0,1 coefficients and are characteristic series of sets of W: a ae 


gE L and let Y = g |X. Then Y = AR. Since R # A”, there exists by Lemma 


prefix code Y’ commutatively equivalent to Y such that Y’ is not of the form wok Soe 


Let X’ be the prefix code commutatively equivalent to X defined by (see Figure 
X' = (X \gY)U gY”. 
In order to prove that e(X') < e(X), consider a factorization 
X'—1=L(A-1)R' (12.6.7) 


and suppose by contradiction that deg(R) < deg( R’). 

Since Y’ is commutatively equivalent to Y, we have deg(Y’) = deg(Y) and thus 
deg(X) = deg(X’). This implies that gY' c X' = LAR’. Consider a word y € P’. 
Then gy € L'AR' implies that gy = g'r with g' € L’ and r € AR. Since deg(L) > 
deg(L’), the word g’ is a prefix of g. Let g = g'h. Then Y! = gl X' =h AR’. 

Suppose first that h =, i.e. that g = g’. Then Y= AR’, a contradiction. 
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A A 


Figure 12.5: The codes X and X’ 


Thus h Æ e. Let a be the first letter of h and set h = ah’. Let b be a letter distinct 
from a (recall that the alphabet is supposed to have at least two elements). We have 


Y! =h AR = KR = (bH) AR = (gdh!) EAR = (goh. 
Since the words of X and X’ which do not begin by g are the same, this implies 
F' = (g'bh') t3 = (g'bh') LAR. 


Since deg(Y') = deg(Y), we have deg(Y’) = deg(R) + 1. Thus the equality f = 
(g’bh’)-!LAR with |g'bh’| = |g| = deg(L) implies Y’ = AR, which is a contradiction. 
| 


5.1 
We can now complete the proof of Theorem 6.0, We use an induction on the integer 


e(X). 
The property is true when e(X) = 0 since then X itself is synchronizing. Indeed, we 
consider the factorization 


LAS ae 


given by Theorem war that X is prefix. Then e(X) = 0 implies D = 0 and 
thus d= 1. 

When e(X) > 1 we have X —1 = L(A —1)R with deg(R) =n > 1. If R = A”, 
then by Lemma Bea I divides 1+A+---+ At with p the period of X. Hence, 
pe>nt+1> in, contradiction with the hypothesis p = 1. Therefore, R #4 A” and 
by Lemma Es there exists a prefix code X’ commutatively equivalent to X such that 
e(X') < e(X), whence the property by induction. 


EXAMPLE 6.5 Consider the maximal bifix code of degree 3 on the alphabet A = {a,b} 
X = aaa + aabA + ab + baa + babA + bba + bbb. 


We have X —1 = (A— ith R = 1+a+b+bA+abA. We choose, with the notation 
of the proof of Lem pase 1 and thus Y = X. We have R = abA. Then, with the 
notation of en e h = a, since (aa)~'R = @ and (ab)-'R = A. Thus 
we obtain 


X' = aaa + aab + ab + baa A + babA + bba + bbb. 


The code X’ is commutatively equivalent to X and synchronized since baab is a syn- 
chronizing word. 
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Figure 12.6: The codes X and X’ 


7 Complete reducibility 


Let A be an alphabet. Let o € Q((A)) be a series and let u € A* be a word. We define 
a series 0 - u by (o - u,v) = (a, uv) for all v € A*. The following formulas hold : 


a+ La; (o-u) -v =0- w. 


Let V, be the subspace of the vector space Q((A)) generated by the series o - u for 
u € A*. For each word w € A*, we denote by w,(w) the linear function from V, into 
itself (acting on the right) defined by 


polw): p> p- w. 


The formula (p- u)~o(w) = p : uw = ppo (uw) is straightforward. It follows that Yo 
is a morphism 


Wo : A* — End (V3) 


from A* into the monoid End(V,) of linear functions from V, into itself. The mor- 
phism Yo is called the syntactic representation of o. 


PROPOSITION 7.1 Let Y be a subset of A* and let o = Y. Let p be the canonical 
morphism from A* onto the syntactic monoid of Y. Then for all u,v € A*, 


plu) = pv) & Yolu) = Yo(v) 
In particular the monoid w,(A*) is isomorphic to the syntactic monoid of Y. 
Proof Assume first that Yo(u) = Yo(v). Then for all r € A*, 
o- ru = (0 - ryWpolu) = (o-r)vo(v) =o-rv. 
Thus also for all s € A*, 
(o, rus) = (o - ru, s) = (0 - rv, s) = (0, rvs). 


That means that rus € Y if and only if rvs € Y, which shows that y(u) = (v). 
Conversely, assume y(u) = (v). Since the vector space V, is generated by the series 
o-r (r € A*), it suffices to show that for r € A*, 


(0 -r)Wo(u) = (0 - r)bo(v) 


J. Berstel, D. Perrin and C. Reutenauer Version 22 janvier 2008 


466 XII. The Polynomial of a Code 


Now for all s € A*, 


((o-r)vo(u),s) = (o-ru,s) = (o,rus) 


The preceding result gives a relationship between the syntactic representation of the 
characteristic series o of a set Y C A* and the syntactic monoid of Y. It should be 
noted that the dimension of the vector space V, can be trichy less than the number 
of states of the minimal automaton of Y (see Example Fait ever, it can be shown 
a hs vector space V, has finite dimension if and only if Y is recognizable (Exercise 


EXAMPLE 7.1 Let o = A*. Then o - u =a for all u € A*. Consequently V, = Qo isa 
vector space of dimension 1. 


EXAMPLE 7.2 Let A = {a,b} and let X C A* be the bifix code of Fig. ipa 


Figure 12.7: 


Let o = X*. We shall see that the vectors 0,0 -a,o0-a’, and ø - b form a base of the 
vector space V,. Indeed, the formulas 


N 


o-ba=0-a 
o-b =0.ab=0-a+0-:a°—0o.b, 


show that the four vectors o,o -a,c -a° and o -b generate V,. A direct computation 
shows that they are linearly independent. The matrices of the linear mappings w,(b) 
in this base are 


0 1 0 0 0 0 0 1 
0 0 1 0 1 0 0 0 
Wo (a) —_ 1 0 0 0 7 WPolb) = 0 1 1 —1 
0 0 1 0 0 1 1 -1 
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The relation between 7, and the minimal automaton of X* is now to be shown. The 
minimal automaton has five states which may be written as 1,1-a,1-a?,1-b,1-0?. 
Let V be the Q-vector space formed of formal linear combinations of these five states. 
The linear function a : V — V, defined by a(1-u) = o- u satisfies the equality 
a(q:u) = a(q)-u and moreover we have a(1-a+1-a?—1-b—1-b?) =0 Thus V has 
dimension 5 and V, has dimension 4. 


Let V be a vector space over Q and let N be a submonoid of the monoid End(V) of 
linear functions from V into itself. The action of elements in End(V) will be written 
on the right. 

A subspace W of V is invariant under N if for p € W,n € N, we have pn € W. The 
submonoid N is called reducible if there exists a subspace W of V which is invariant 
under N and such that W 4 {0},W # V. Otherwise, N is called irreducible. 

The submonoid N is completely reducible if for any subspace W of V which is invariant 
under N, there exists a subspace W’ of V which is a supplementary space of W and 
invariant under N. 

If V has finite dimension, a completely reducible submonoid N of End(V) has the fol- 
lowing form. There exists a decomposition of V into a direct sum of invariant subspaces 
Wi, Wo,--> , Wk, 


V=W 98W29- Wk 


such that the restrictions of the elements of N to each of the W;’s form an irreducible 
submonoid of End(W;). In a base of V composed of bases of the subspaces W;, the 
matrix of an element n in N has a diagonal form by blocks, 


| | 


Let M be a monoid and let V be a vector space. A linear representation w of M 
over V is a morphism from M into the monoid End(V). A subspace W of V is called 
invariant under w if it is invariant under 7(M). Similarly ~(M) is called reducible, 
irreducible, or completely reducible if this holds for y(M) 

The syntactic representation of a series ø is an example of a linear representation of 
a free monoid. The aim of this section is to study cases where this representation is 
completely reducible. We recall that all the vector spaces considered here are over the 
field Q of rational numbers. The following result is a classical one. 


THEOREM 7.2 A linear representation of a finite group is completely reducible. 


Proof Let V be a vector space over Q. It suffices to show that each finite subgroup of 
the monoid End(V) is completely reducible. Let G be a finite subgroup of End(V) and 
let W be a subspace of V which is invariant under G. Let W1 be any supplementary 
space of W in V. Let 7: V — V be the linear function which associates to p € V the 
unique pı in W, such that p = pı + p’ with p’ € W. Then z(p) = 0 for all p € W and 
™(p) = p for p € Wy. Moreover, p — 7(p) E€ W for all pe V. 
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Let n = Card(G). Define a linear function 0: V — V by setting for p € V, 


o(p) = 1 X a(pg)g™. 
gEG 


Let W’ = 0(V). We shall see that W’ is an invariant subspace of V under G which 
is a supplementary space of W. First, for p € W, 


o(p) = 0. (12.7.1) 


Indeed, if p € W, then pg € W for all g € G since W is invariant under G. Thus 
™(pg) = 0 and consequently 6(p) = 0 Second, for p € V, 


p—O(p) € W. (12.7.2) 


Indeed i 1 
p—(p) =p- =) arlogg = =) log- (09) 97. 
gEG gEG 
By definition of 7 each pg — 7(pg) is in W for g € G 
also (pg = (pg))g* € W. This shows formula (2,72). 
By we have W C Ker(0) and by (2.7.2), Ker(0) C W since p € Ker(@) 
implies p — 0 (p) = p. Thus 


W = Ker(0) 


1 
ae (era further shows that 6? = 6. a, at — 6°(p) = 0(p — 9(p)). By 


, p—9(p) € W. Hence 0(p) — 07(p) = 0 by ( - Since 0? = 6, the subspaces 
W = Ker(0) and W’ = im(@) are supplementary. Finally, W” is invariant under G. 
Indeed, let p € V and h € G. Then 


1 = 
A(p)h = = Sa (0g)g *h. 
gEG 
The function g + k = h~'g is a bijection from G onto G and thus 
1 E 
A(p)h = = X n(phk)k™ = 0(ph). 
gEG 


This completes the proof. n 


THEOREM 7.3 Let X C A% be a thin bifix code. The syntactic representation of X* is 
completely reducible. 


[a2 
In the case of group codes, this theorem is a direct consequence of Theorem EF For the 
Ta case, we need the following proposition in order to be able to apply Theorem 


PROPOSITION 7.4 Let X C At be a thin prefix code and let y = Yy» be the syntactic 
representation of X*. The monoid M = w(A*) contains an idempotent e such that 
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(i) e € W(X"). 
(ii) The set eMe is the union of the finite group G(e) and of the element 0, 
provided 0 € M. 


Proof Let S be the syntactic monoid of X* and let p : A* — S be the canonical 
morphism. Consider also the minimal automaton A(X*) of X*. Since X is prefix, 
the gufomaton A(X*) has a single final state which is the initial state (Proposition 
Ba tet u = ~a(x~) be the morphism associated with A(X*). We claim that for all 
u,v E€ A*, 


plu) = wv) = Cu) = vr) (12.7.3) 
e404 
Indeed, in view of Proposition TEA have 


(u) = wv) = plu) = (v). 


and by Proposition a 
plu) = vv) & Yu) = ptr). 


Formula (erat shows that there exists an isomorphism (3 : u(A*) > v(A* zM 
defined by B o u = y. In particular, Y(X*) = B(u(X*)). By Theorem , the 
monoid M has a unique 0-minimal or minimal ideal, say J, according to whether M 
does or does not have a zero. There exists an idempotent e in J which is also in Y(X *). 
The H-class of this i mpotent is isomorphic to the group of X. 

Proof of Theorem or convenience, set V = Vy» a d_dgnote by w the syntactic 
representation wx». Let M = ~(A*). By Proposition aoe exists an idempotent 
e € w(X*) such that eMe is the union of 0 (if 0 € M) and of the group G(e). The 
element 0 of the monoid M corresponds to the zero of 7(A*). Let L = Me and define 
S = {pe | p E€ V}. Since e? = e, we have te = 7 for all r € S. Next, for all 1 € L, we 
have le = l since 1 = me for some m € M and consequently le = me? = me = l. Thus 
for alll € L, 


VIC S. (12.7.4) 


Let W be a subspace of V which is invariant under M. We shall see that there exists 
a supplementary space of W which is invariant under M. For this, set T = W N S and 
G = G(e) 

The group G acts on S. The subspace T of S is invariant under G. Indeed, let T L 
and let g € G. Then Tg /ipce W is invariant under M and tg € S by B 
since g = ge. By theorem [7.4, there exists a subspace T’ of S which is supplementary 
of T in S and which is invariant under G. Set 


W={peV|VWleL,pleT} 


We shall verify that W” is a supplementary space of W invariant under M. First observe 
that W” clearly is a subspace of V. Next it is invariant under M since for p € W’ and 
m € M, we have, for all l € L, (om)l = p(ml) € T’ and consequently pm € W”. 
Next we show that 
T cw’ (12.7.5) 


Indeed, let 7” € T’. Then 7’ € S and thus 7'e = 7’. Hence r’l = 7’el for all l € L. Since 
el € eMe and since T” is invariant under G, it follows that r'l € T’. Thus 7’ € W’. 
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Now we verify that V = W + W’. For this, set o = X * and first observe that 
oe =0 (12.7.6) 


(Note that o € V and e acts on V.) Indeed, let x € X* be such that y(x) = e. Since 
X* is right unitary, we have for all u € A* the Ws xu E€ X* sue X*. Thus 


(oe, u) = (o> alts z (o, xu) = (o, u) which proves ( 

In view of ( , we have ø € S. Since S = T + T’, there exists 7 € T and 7’ € T’ 
such that o = r +7’. Then for all m € M, Tm = rm. For each m € M, 
rm € Tm C Wm C W, whence rm € W. Using (t , also rm € T'm C W'm 
Since W’ is invariant under M, we obtain r'm € W’. Thus om € W + W”. Since V is 
generated by the vectors om for m € M, this proves that V = W + W”. 

Finally, we claim that W N W’ = {0}. Indeed, let p € WOW’. Then for all l € L, 


pl =0 (12.7.7) 


E le L. Then pl € W, since W is invariant under M and pl € S by Eq. 
. Thus pl € W N S = T. Further pl € T’ by the definition of W’ and by the 
fact that p € W’. Thus pl € TNT’ = {0}. 

Since V is generated by the series o-u (u € A*), there exist numbers a, € Q (u € A*) 
with only a finite number of them nonzero such that 


p= `> Qula- u). 


ucA* 


Again, let x € X* be such that y(x) = e. Since X* is left unitary, we have, as above, 
(o, w) = (o, wx) for all w € A*. Thus for all v € A*, 


(p, v) = `> Oye u,v) = `> Qula, uv) 


ucA* uc A* 
= `> aulo, wwz) = > Qula- u, VL) 
ucA* ue A* 


(p, vz) = (p- va, 1). 


Thus s ing m = p(v), we have (p, v) = (pme, 1),and since me € L, we have pme = 0 
by . Consequently (p, v) = 0 for all v € A*. Thus p = 0. This shows that 
W N W' = {0} and completes the proof. z 

EXAMPLE 7.2 (continued) The subspace W of V = V, generated by the vector p = 
o+o0-a+o-a? is invariant under Yo. Indeed we have 


p:a=p, p:b=p 


We shall exhibit a supplementary space of W invariant under w,. It is the subspace 
generated by 
fap 0—00, o—o0:b. 


Indeed, in the base 


2 
p, 7-0-a,a0-a-a°,o-0a-b, 
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the linear mappings w,(a) and w,(b) have the form 


1/0 o 0 
olı 1 0 

a=) ol rca = 
0/-1 1 0 


We can observe that there are no other n n trivial invariant subspaces. 
We now give a converse of Theorem or the case of complete ode. The result 
does not hold in general if the code is not complete (see Example 


THEOREM 7.5 Let X C At be a thin complete code. If the syntactic representation of 
X* is completely reducible, then X is bifiz. 


Proof Let A = (Q,1,1) be a trim unambiguous automaton recognizing X*. Let y be 
the associated representation and let M = y(A*). 

Set o = X * and also V = Vs, YW = Yo. Let u be 1g ganonical morphism from A* 
onto the syntactic monoid of X*. By Proposition et we have for u,v E€ A*, y(u) = 
(v) = u(u) = u(v). Thus we can define a linear representation 0: M — End(V) by 
setting for m € M, O(m) = u(u) where u € A* is any word such that y(u) = m. If w 
is completely irreducible, then this holds also for 6 . 

For notational ease, we shall write, for p € V and m € M, p-m instead of p-u , 
where u € A* is such that y(u) =m. With this notation, we have for m = y(u), 


p: u = pp(u) = p6(m) = p-m. 
Observe further that with m = y(u), 

(o -m,1) = (aa, 1) = (0, u). 
Hence 


if u € X*, 
otherwise 


(o-m1)={ 4 


Finally,we have for p € V, m,n E€ M, (p-m)-n=p-mn. For p€ V and for a finite 
subset K of M, we define 


(12.7.8) 


8 
In particular, (TAE gives 
(o - K,1) = Card(K N y(X*)). (12.7.9) 


The code X being thin and complete, the monoid M has a minimal ideal J that 
intersects p(X*) . Further, J is a D-class. Its R-classes (resp. L-e a, „are, athe 
minimal right ideals (resp., minimal left ideals) of M (see Chap. eC ; 

Let R be an R-class of J and let L be an L-class of J. Set H= RAL. For 
each m € M, the function h +> hm induces a bijection from H onto the H-class 
Hm = Lm R. Similarly, the function h +> mh induces a bijection from H onto the 
H-class mH = LA mR. 
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To show that X is suffix, consider the subspace W of V spanned by the series 


o-H-o-K (12.7.10) 


for all pairs H, K of H-classes of J contained is the same R-class. We shall first prove 
that W = {0}. 

The space W is invariant under M. Indeed, let H and K be two H-classes contai i Lows 
some R-class R of J. Then form € M, (o-H)-m = o- (Hm) since, by Proposition ise 
the right multiplication by m is a bijection from H onto Hm. Thus (o-H—-o-K)-m= 
o-(Hm) —o- (Km) and the right-hand side is in W since Hm, Km C R. 


Next for all pE W and me J, 
p-m=0 (12.7.11) 


Indeed, let H and K be two H-classes contained in an R-class R of J. Then for m € J, 
Hm,Km C RA Rm. Since RN Mm is an H-class, we have Hm = Km = RA Mm. 
Thus 

(o-H—-0o-K):-m=0. 


1 
Since p € W is a linear combination of series of the form Erto this proves ETT. 


Since the representation of M over V is completely reducible there exists a supple- 
mentary space W’ of which is invariant under M. Set o = p+)! with p € W, p' € W”. 
Let H, K be two H-classes of J contained in an R-class R. We shall prove that 


o:-H=0.K (12.7.12) 


We have 
c:-H—-0o.-K=(p-H-p:K)+(P-H-p-K). 


1 
Since p € W and H, K C J, it follows from Erny that 


p-H=p-K=0 (12.7.13) 


Next, there exists numbers am E Q (m € M) which almost all vanish such that 
P' = J em &m(0 : m). Since the left multiplication is a bijection on H-classes, we have 


(o-m)-H—-(o-m)-K =0.(mH)-—o. (mK). 


Thus, since mH, mK C mR, the righ-hand side is in W. Thus also p'- H — p' -K € W. 
Since W’ is invariant under M, this element is also in W’. Consequently it vanishes 


and 
p-H=p'-K. (12.7.14) 


T2712) fo ows from (T2-7,13 
In view of (T2.7.9), formula (E 
in J, then it, inter ects all H-classes which are in the R-class containing H. In view of 
Proposition g ea is equivalent to X being suffix. 
We conclude by showing that X is prefix. Let T be the subspace of V composed of 
the elements p € V such that (p - H,1) = (p - K,1) for all pairs H, K of H-classes of J 
contained in a same £-class. 
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The subspace T is invariant under M. Indeed if p € T and H, K C L, then for all 


m EM, 
(p-m): -H =p: mH, (p:-m):- K=p- mK. (12.7.15) 
Since mH, mK are in the £-class L, we have by definition ((p-m)-K,1) = ((p-m)-K, 1). 
Thus pem ET. 
Next for all m € J, and pE V, 


p-meT. (12.7.16) 


Indeed, let m € J and let H, K be two H-classes contained in the L-class L C J. Then 
mH = mK. Thus by , ((0:m)-H,1) = ((o-m)-K,1). Thus p-m eT. 
Let T’ be a supplementary space of T which is invariant under M. Again, set 


o=p+p 


this time with p € T, p' € T’. Let H, K be two H-classes in J both contained in some 
L-class L. Then 


(o: HA) (e+ K,1) = ((P - H,1) = (p - K,1)) + (œ - H,1) — (P - K,1). 


By definition of T, we have (p - H,1) — (p - K,1) = 0. In view of BETTS, we have 
P- H, p -K €T whence p’- H — ERS € TOT = {0}. Thus (o - H,1) = (a+ K,1). 
Interpreting this equality using Tam it is shown that if y(X*) meets some H-clas 
of J, it intersects all H-classes contained in the same L-class. By Proposition [VII4.8) 
this shows that X is prefix. 


8.8 


EXAMPLE 7.3 Let A = {a,b} and let X = {a, ba}. The code X is prefix but not suffix. 
It is not complete. 
Let o = X.*. The vectors o and a - b form a base of the vector space V, since 


The representation w, is irreducible. Indeed, 


volta) =| y oh w-vl 9 |, 


vo=| 5 oh wa-wa rl 


Thus the matrices Yo (u), u € A* generate the whole algebra Q?*?. Thus no nontrivial 
subspace of V is invariant under A*. 


7.5 
This example shows that Theorem oes not hold in general for codes which are not 
complete. 
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8 Exercises 


. ction8.1 
Section 


1.1 A code X C A? is called separating if there is a word x € X* such that each 
w € A* admits a factorization w = uv with ru, vx € X* 


(a) Show that a separating code is complete and synchronous. 
(b) Show that a separating code is factorizing and that its factorization is unique. 


1.2 Let X C At bea synchronous code and let A = (Q,1,1) be a trim unambiguous 


automaton recognizing X*. For x € X* let 
U(z)={peQ|1>p}, Vie) ={geQ|a> ]}. 


Show that X is separating if and only if there is a word x such that «A*x c X* and 
any path from a state in U(x) to a state in V(x) passes through state 1. 


1.3 Let X C At be a code. A pair (L, R) of subsets of A* is called a separating box 


for X if for any word w € A* there is a unique pair (£,r) € L x R such that w admits 
a factorization w = uv with fu, vr € X*. 
Show that a code which has a separating box is factorizing. 


2x08.0bis.1quatro| 1.4 Let X C At be asynchronous code and let A = (Q,1,1) be a trim unambiguous 
automaton recognizing X*. ForS,T C A*, let = es va(s)1s and c = Der Yalt)a1- 
Show that (5,7) is a separating box if and only if 


(i) for each w € A*, one has fy ,(w)c = 1. 
(ii) Any path from a state of £ to a state of c passes through state 1. 


1.5 Let b € A be a letter and let X C AT be a finite maximal code such that for all 
x € X, |a|y < 1. Let A’ = A — b. Let X’ = X N A”. Show that there is a factorization 
(P,Q) of X’ considered as a code over A’ such that 
X = X' U POQ. 
Obis.2 


8. 
1.6 Let A = {a,b}. Use Exercise o show that a finite code X C a* Ua*ba* is 


maximal if and only if X = a” U PbQ with n > 1 and P,Q C a* satisfying PQ = 
ltat---+a™!. 


1.7 Let X,Y C A* be two distinct finite maximal prefix codes such that X NY # 0. 
Let P = A* — X A*, Q = A* — Y A* and let 


Rc(xny)y* 


be a finite set satisfying uv € R,u € (XN Y)* =v € R. (This means that R is 
suffix-closed considered as a set on the alphabet X NY.) 


(a) Show that there is a unique finite code Z C A* such that 


Z-1=(XNY -1)R 
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(b) Show that there exists a unique finite maximal code T C AT such that 


T -1=(P +wQ)(A -1)R 


Ai] 


where w is a word of maximal length in Z. 
(c) Show that the code T is indecomposable under the following three assumptions: 
(i) Z is separating. 
(ii) Card(P U wQ) and Card(R) are prime numbers. 
(iii) R is not suffix-closed (over the alphabet A). 


.Obis.1i 
(d) Compare with Example (rit by taking P = {1,a}, Q = {1,a,b}, R = {Lae}, 
w = abaa. 


1.8 Let A = {a,b} and let 
X =(4#?-b)UBA, Y=A7avd. 


(a) Verify that X is a maximal prefix code and that Y is maximal suffix. 
(b) Show that the code Z defined by Z* = X* N Y* satisfies 


Z-1=(1+4A+b)(A4-1a(A-1)+4-1)(1+a+ Aa), 


[Hint: Show that Z — 1 = (X —1)P = Q(X —1) for some P C X*, Q C Y*] 
(c) Show that Z is synchronous but not separating. 
(d) Show that Z has a separating box. 


xx 


1.9 Let X C At be a set. A word x € X is said to be a pure square for X if 
(i) x = w? for some w € At, 
(ii) X N wA* N A*w = {zx}. 


(a) Let X C A* be a finite maximal prefix code and let x = w? be a pure square 
for X. Set G = Xw7!, D=w!X. Show that the polynomial 


o=(1+w)(X -1+(G -1)w(D -1)) +1 


is the characteristic polynomial of a finite maximal prefix code denoted by ôw (X). 
[Hint: Set Gi = G — w and Dı = D — w. Show that o = (1+ w)R + wt where 


R=(X —Giw-—wD)+GywD +w°Dı 


is a prefix code]. 
Show that the polynomial 


(X —1+(@—-1)w(B -1))1+w)+1 


is the characteristic polynomial of a finite maximal code denoted by yw(X) 
(b) Let X C A* be a finite maximal prefix code. Show that if x = w? is a pure 
square for X, then x? is a pure square for ôw(X) and yw(X). 
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(c) Let X C At be a finite maximal prefix code. Let x = w? be a pure square for 
X. Show that the codes Y = yy(X) and Z = dw(X) have the same degree. 
(Hint: Show that there is a bijection between Y-interpretations and Z-interpretations 
of a word.] 

(d) Let X be a finite maximal bifix code. Let 2 = w? be a pure square for X and 
Y = by(X). Show that d(X) = d(Y). 

[Hint:Show that Y — 1 = (1+ w)(A — 1)L where L is a disjoint union of d(X) 
maximal prefix code.] 

(e) Let X be a finite maximal bifix code. Let x = w? be a pure square for X and let 
Y = 6w(X). By (b) the word x? is a pure square for Y. Let Z = y,(Y). Show 
that d(Z) = d(X). 

[Hint: Set T = 6,(Y). Show that T—1=(1+w)(1+w?)(A—1)M where M is 
a disjoint union of d(X) prefix codes.| 

(f) Show that if d(X) is a prime number and d(X) > 2, the code Z of (e) does not 
admit any decomposition over a suffix or a prefix code. 

(g) Use the above construction to show that for each prime number d > 3, there 
exist finite maximal codes of degree d which are indecomposable and are neither 
prefix nor suffix. 


lsection8.2 
Section 


2.1 Show that if Y is a weak left divisor of then one may find polynomials 
P, Q1, Q2, R, satisfying the hypothesis of Theorem B.I — 


2.2 Show that if the z1,..., £n are elements of a field and if the fraction 


. 1 
x 
1 F 1 
x 
2 I 
oe ee 
Tn 
is defined, then it is equal to 
p(«1, Pn) 
p(xa, > sy) 


2.3 Show that if k < n, then 


plai, ..., an) plan—1,...,ak)— plai, ...,an—1) PO ek) 
= (ar p(aj1,..-,Ap—2) 


2.4 Show that p(1,...,1) (n times) is the n-th Fibonacci number. 


. section8.3 
Section 


.3.1 
3.1 Show that S(u) (resp. P(u), F(u, v)) defined in the proof of Lemma fr i is a sum 
of proper suffixes (resp. prefixes, factors) of words of C. 


3.2 If S € Z((A)) has constant term 0 and a € A, show that a~1(,$*) = (a71 S) S*. 
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i section8.4 
Section 


4.1 Show that if l is the number of leaves of a finite complete a-ary tree, and 7 the 
number of its internal nodes then /—1 = 7(a—1). Deduce from the literal representation 
of a complete prefixe code, the corresponding equality relating its cardinality to the 
number of its prefixes. 


. section8.6 
Section 


* 5.1 Let X be the circular code X = {a,ab,c,acb}. Show that there is no bijection 


a:X — Y of X onto a prefix code Y such that a(x) is a conjugate of x for all x € X. 


5.2 Let u(z) = oa unz” with un > 0. Let k > 1 be an integer. Show that 
1—u(z)/(1 — kz) has nonnegative coefficients if and only if u(1/k) < 1. 


i section8.7 
Section 


6.1 Let A = (Q,i,T) be a finite automaton. The aim of this exercise is to construct 
the syntactic representation of the series o = |A]. 

Let y be the representation associated with A and let M = y(A*). We may assume 
Q = {1,2,...,n} and i= 1. 

Let Æ = Q” let Eo be the subspace of E generated by the vectors Mix, m € y(A*). 
Let E be the subspace of Eo composed of all vectors l in Eo such that for all m € M 
Meer (lm)t = 0. 

Show that the linear function a: Eg > V; defined by a: y(u)ix + o- u has kernel 
Eı. Deduce from this fact a method for computing a basis of V, and the matrices of 
o(a) in this basis for a € A. 


6.2 Let S C At and o = S plow that V> has finite dimension if and only if S$ is 


recognizable (use Exercise 2 i 


6.3 Let K be a commutative field and let o € K((A)). The syntactic representation 
of o over K is defined as in the case K = Q. Recall that the characteristic of a field is 
is the greatest common divisor of all integers n such that n-1=0 in K. 

Let X be a very thin bifix code. Let K be a field of characteristic 0 or which is 
prime to the order of G(X). Show that the syntactic representation of X * over K is 
completely reducible. 


6.4 Let X be a very thin bifix code. Show that if X is synchronizing, then Y% x«(A*) 
is irreducible. 


9 Notes 


ection8s.ibis ni are froin 3 
The results i in Section Brand the proof in Section are nf seo erie Co Theo- 
rem ml 3X ends a co tative factorization result by Bchiitzenberger| ({L965b). Theo- 
rem B. I/and Corollary BREE a particular case of Paul Cohn’s weak algorithm, see Cohn] 


For their proofs, we have followed a lexicographic argument from 
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.2.4 st8.2.8 
(1993) 1993) a baporen ara and ae Hi are Ton Cog} 4985). Theorem $4 dj Lemma B.8 
ee are, fron, | Reutenauer| ({1985)). TES Ji ‘is due to Ẹ (1L961b). 
Cal 1s 


(L965) 


ue to [Hansel and Perrin t Perri ). Corollary p asi from Schitzenberges 


Note that the relations (ii) and (iii) in Lemma EEG each a weak form of the fac- 
torization conjecture, since Lı is a finite sum of words (for the conjecture, one would 
need to have Lı = 0). This form was also found by (1992). For par- 
tial results on the factorization conjecture, see foio (L977), (K981), 

[and Reutenaue (986), [De Felice] (1992), [De Felice De Felice} ({1993)). For results involving con- 
structions of factorizing codes and multiple — see (L977ah, 
(L985), (L992). 


Theorem 6.0] is from [Perrin and Schiitzenberger (992). 


The problem of characterizing commutatively prefix codes has an equivalent formula- 
tion in terms of optimality of prefix codes with respect to some cost functions, namely, 
the average length of the code for a given weight distribution on the letters. In ee con- 


text, it has been treated in several pa pers sand, in particular in Il] (L974), 

Ken t 1961). The codes of Example p 6.3 have been studied under the name of paygnet 

codes (198); [E (L982); [D (L983)). Example E is due 

to = (1983). It is a counterexample to a conjecture of Se 

(1981). A particular case of commutatively prefix codes is studied in 
[Resting (50). | 

Results of Section fare duc to (L981). The syntactic representation 
appears for the first time in (1961c). It has been developed more 
systematically in (1974) and in (L980). 

Theorem 7.2 is Maschke’s theorem. The property for an algebra of z to 
be completely reducible is equivalent to that of being semisimple (see, e.g., 
((1969))). Thus Theorem expresses that the syntactic algebra w,(A*) for o = X *, X 
a thin bifix code, is semisimple. This theorem is a generalization of Maschke’s Beste: 
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chapter1 
Chapter f 
chaettiáni.1 
Section fiii 


apteri.1 
Any word w = a bath... baf" with kı,... kr > 0 has at most one factorization 


w = ayga" y ---yp_ja"” where yu = abate with ko = io mod n, kp = jr—1 
mod n and for 1 < u < r — 1, ku = ju-1 +i, mod n. 


. section1.2 
Section 


ra.1 
Te The proof is by induction on fl + |y|. Suppose that |z| < |y|. If X is not a 
code, then z is a prefix of y. Let y = xy’. Then X’ = {z,y'} is not a code and we have, 


by induction hypothesis, x,y’ € z*. Thus x,y € 2*. 


MES This amounts to prove that for each code X, the algebra K(X) is free with basis 
X. Consider a polynomial P = *"_, a;w; = 0 for some w; € B*, such that (P) = 0, 
and set G(w;) = xi. For each zj, one gets 0 = (G(P),2;) = >) ai(xi, £j). Since X is a 
code, (x;, xj) = 1 if i = j, and 0 otherwise. Thus a; = 0 for all j. 


12.3 
ES stable submonoid satisfies this condition. Conversely, let u,v,w € M be such 
that u,v,uw,wuv E€ N. Then n = vu, m = w satisfy nm,n,mn € N and thus w € N. 
Thus N is stable. 


eas stable submonoid of a commutative monoid is right unitary: If u,uv E€ N, 
then also vu € N and thus v E€ N. 
14.5 2.10 

e proceed as in the proof of Proposition Eg Suppose that y € Y is not 
n (Y*)-1X. Then Z = y*(Y — y) is such that X C Z* c Y*, Z* # Y* and Z* is 
right unitary, a contradiction. This proves (a). Statement (b) follows directly. For 
X = {a,ab}, we have Y = {a,b} and thus Card(X) = Card(Y) although X is not a 
prefix code. 


12.6 
Teo We show by induction on n > 0 that if Y is a code such that X C Y*, then 
Sn C Y*. It is true for n = 0. Assuming the property true for n, let w E€ S97 1S,NS,S71. 
Let u,v E€ Sn be such that uw, wv € Sn. Then uw,wv € Y* by induction hypothesis 
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and thus w € Y* since Y* is stable. Hence S'S, S;,S,' C Y* and consequently 
Sn+1 C Y*. This shows that S(X) is the free hull of X. 

To prove the second statement, we introduce an intermediary statement. For any 
Z C A*, define U; and V; by Up = Vo = {1} and for i > 0 by Uj41 = U7'Z U 20. 
Visi = ZV," UV;Z—!. Let U = U;i>0U; and V = UjsoVj. Setting Q = Z*, we prove 
that 


(QNR = (Unv)* (12.9.1) 


To prove AES consider first w € UNV. It is easy to see that U C Q7'Q and 
V c QQ. Thus w € QIQ N QQ7!. This proves one inclusion. Next, consider 
w € QIQ N QQ. One may verify that QIQ c UQ, and QQ7! c QV. We have 
w = uq and wd’ € Q for some u € U and q,q' € Q. Since uqq' € Q, we have u € QQ1. 
Since u € QQ7! and QQ! C QV, we have u = q"v for some q” € Q and v € V. Since 
Q-!U CU, we have v and thus w = q'vq E€ QUNV)Q. Since Q CUNT, this 
competes the proof of ae rh 

If X is recognizable, let y : A* — M bea morphism on a finite monoid M recognizing 
X. Then each submonoid Sn is generated by a set Zņ recognized by y. Indeed, it is 
true for n = 0 since Sg = X*. Arguin induction, let us suppose that Sn = Zà 
where Zn is recognized by y. Then, by af ti. we have Sn+1 = (U N V)* where U,V 
are recognized by y. Then the free hull of X is generated by the union of all Z,,, which 
is also recognized by y. Therefore it is recognizable. 


ra.7 
een This is a direct consequence of the closure of the family of recognizable sets by 
boolean operations, product and star. 


cHepb br .8 
e conditions are obviously necessary. Conversely, let A be the set of elements 


watch cannot be written be with b,c # 1. Condition (i) shows that this set generates M. 
Indeed, if m = bc, with b,c Æ 1, then A(b), A(c) < A(m), so any m has a decomposition 
as a finite product of elements in A. Condition (ii) implies that the decomposition is 
unique. Thus M is isomorphic with A*. 


g section1.3 
Section 


ra. 1 
MET We have (u,v) € p* iff there exist £1,... , £n, Y1,- --, Ym E X such that uz1 -£n = 


Yi +*+- YmV With u prefix of y1, v suffix of £n, £1 Æ Y1, En Æ Ym- 


. section1.3bis 
Section 


fet iF M is recognizable and free, let X be the code such that M = X*. Since 
X = (M — 1) — (M —1)?, X is recognizable. Let A be a deterministic finite au- 
tomaton recogni ing, X . Then the automaton A* = (Q,1,1) is finite, trim and, by 
Proposition ieee an unambiguous automaton recognizing X*. Conversely, let 
A = (Q,1,1) be an unambiguous trim finite ufonaton. The set M recognized by A 
is recognizable submonoid. By Proposition Raat is free. 
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: section1.4 
Section 


[eet X word of D which begins with a ends with b and conversely. Thus D = Da + Dy, 
in characteristic series. If £ € Da let x = aub. Then |ulq = |uly and thus u € D*. 
Actually, u € Dž since otherwise x would have a proper prefix in D. This proves the 
equality D = aD7b+ bDya. The sets Da, Dy have by symmetry the same series. Thus, 
we obtain from the previous equality d(t) = 2pqt?/(1—d(t)/2) or d(t)? —2d(t) +4pqt? = 
0. The final formula results from the usual formula to express the roots of an equation 
of degree 2, choosing the minus sign to obtain a solution with zero as constant term. 


14.2 1.1.1 
Teo The fact that X is a code is checked like in Exercise E te m be a Bernoulli 


distribution with m(a) = p, n(b) = q. Set U = {i +j |i E€,j E j,i +j <n}. We have 
in characteristic series aY + aV = (a” — 1)/(a — 1) and ala = aV + a”aV. Thus 


T 
m(X)-1 = F a 
O gpl +a n 
= Sf t+p"-1 
1 — qpY 
_ gp tapp +p" —1—p"p qt pq 
1—qpv 
_ ge =\i(p~Y+p"-1_, 
Lo" 


which shows that X is maximal. Another approach consists in showing directly that 
X is complete. 


. section1.5 
Section E 
rā.1 
MeT to check that X is complete, we compute the minimal automaton of X* shown 
pa and d 


on Figure educe that bA*b C X*. If one withdraws an element of X, it is 


a,b 


b 
Figure 12.8: The minimal automaton of X* 


not complete anymore. For example, if a? is withdrawn, the word a* is not a factor 
of {b, ab, ba”, aba?}*, and similarly for the other words of X. Finally, X is not a code 
since (b)(aaa)(b) = (baa)(ab). 
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ea The first statement is clear. Let us suppose that X is contained in a finite 
maximal code Y. Let U = {i > 0 | a’b+ NY Æ Ø} and V = {j >0| bta NY FO}. 
Then for each integer q there is exactly one pair (i, j) E€ UxV such that q = i+j mod p. 
Indeed, for large enough n,m, k, a Y-factorization of the word b”a%t+*Pb™ gives such a 
pair and conversely. Since Y is a code containing the letter b, such a pair is unique. 
Thus p = Card(U) Card(V). Since I C U and J C V, we have Card(U), Card(V) > 2 
a contradiction with the hypothesis that p is prime. 


ies The family F is closed under arbitrary union and intersection and Ø € F. We 
may thus consider the topology for which F is the family of open sets. Let P be dense 
in the sense that for any m € M, there exist u,v € M such that umv € P. Then any 
two-sided ideal has a nonempty intersection with P. Thus P is dense in the sense of 
the topology and conversely. 


[ea The first equality is clear since y is unbordered. The second one results from 
V =U + X*, and thus Vy = Uy + X*y. For the last identity, set Z = y(Uy)*. Then 
Y= X+Z, and (X*y(Uy)*)* = (X*Z)* = 1+(X*Z)*X*Z = 14 (X4- ZZ = 1+Y*Z. 
Consequently, A* = (Uy)*(X*Z)*V = (Uy)*V + (Uy)*Y*ZV. The fact that Y is a 
code follows from the equality between characteristic series A* = R + PY*Q with 
R = (Uy)*V, P = (Uy)* and Q = y(Uy)*V. The fact that is complete also folows 
easily since P,Q, R are clearly factors of Y*. 


br8..5 
tes Tt X is recognizable, then U = A* — — A*yA* is recognizable and so is 
Y=X+y(Uy)*. 


c erä. 6 
eg Tet X bea thin code. If X is complete, then it is on Fae and there is nothing to 


prove. Otherwise we apply the construction of Proposition 6-4}to build Y = X +y(Uy)* 
starting with an unbordered word y ¢ F(X*). Then y? g F(Y) and thus Y is a thin 
maximal code containig X. 


F ction1.6 
Section A 
ré.1 
MEN Let us first suppose that e gigcomposable, i.e. that X C Y* where Y is a code 


with Y #4 A,X. By Proposition “4s bifix. We first prove that Y* is also recognized 
by y. Let us consider u € Y* and v € A* such that y(u) = (v). Let w € A* be such 
that uw € X*. Since Y is prefix, we also have w € Y*. Since w(uw) = y(vw), we have 
uw € X*. Thus u € Y*. Thus y(Y*) is a subgroup of G containing H and H is not 
maximal. 

Conversely, if H is not maximal, then H C K, where K is a subgroup with K 4 H,G. 
Let Y be the bifix code such that Y* = y~1(K). Since X c Y* and Y C F(X*), the 
code X is decomposable over Y. 


eo ir x is prefix, there is nothing to prove. Otherwise, one of the two words, say 
x is prefix of the other. Let y = xy’. Reasoning by induction, we may assume that 
Z = {x,y'} is composed of prefix and suffix codes, whence the conclusion for X since 
X =Y o Z with Y suffix. 
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encourt 
lies The code Y is formed of 11 words: 


Y = {a, aba, babaaa, babaaaba, babaab, babaabba, (ba)* , bababb, bababbba, bb, bbba}. 


An easy computation shows that if X C Z* with Z prefix, then Z = {a,b} and the 
same conclusion for Z suffix. 

To obtain Y as in Exercise VIII.1.4, choose P = {e,b}, Q = {e,a,b}, R = {e, ba} and 
w = baba. The code Z defined by Z — 1 = P(A — 1)R is separating because b is a 
separating word. 


chapter2 
Chapter f I 
p section2.1 
Section i 


21.1 
P is infinite, there is at least one letter pı which is a prefix of an infinite 


number of elements of P. Then among this set, there is an infinite number of elements 
with the same prefix of length 2, and so on. 


chdexe?21.2 . . 

ES Madea X A* NA” is the disjoint union of the sets (XN A’) A” for 1 < i < n—1. 
Thus Card(X A*N A”) < X; ] ak” < k”. The desired inequality is obtained dividing 
both sides by k”, and taking the limit for n — oo. 


. section2.2 
Section 


22.1 
MET Let p(p) = i-p. Then pis surjective since A is trim. The identity p(p-a) = p(p)-a 
is easy to verify in both cases pa € X and pa € P. In the first case both sides are equal 
to į and in the second case, they are both equal to i- pa. 


. section2.3 
Section 


chaexe? 23.1 
MET This is a direct consequence of the equality of characteristic series X — 1 = 
P(A-—1). 


23.2 
MB 2 Tet A(X) = minzex |z|. Then åA is clearly a morphism from the monoid of pr j 
bie we 


subsets into the additive monoid N. To be able to apply the result of Exercise [2-8 
have to prove first that \~!(0) = 1. Indeed, {1} is the only prefix set containing 1. Next, 
let X,Y,Z,T C A* be prefix sets such that XY = ZT. Suppose that A(X) < A(Z). 
Let x € X be of minimal length. and let U = 2~!Z. For each y € Y there are 
z € Z,t € T such that cy = zt. Then z = xu and y = ut for some u € U. Thus 
Y CUT. Conversely, let u € U and t € T. Then cut € ZT = XY hence ut € Y. Thus 
Y = UT and XU = Z. If X and XY are maximal prefix sets and if Y is prefix, then 
Y is also maximal. Thus the submonoid of maximal prefix sets is right unitary. The 
submonoid of recognizable prefix sets is also right unitary. 


J. Berstel, D. Perrin and C. Reutenauer Version 22 janvier 2008 


484 Solutions of Exercises 


i section2.4 
Section 


24.1 
MET Each word 7(Z) is clearly in Dž. The mapping y is onto because Da = aD*b 
(see Exercise [.4.1). The formula |y(Z)| = 2 Card(Z) — 2 is proved by induction. It is 


true for Z = 1. Then, supposing it true for X and Y, |y(Z)| = |y(X)| + ly( it ias 


2Card(X) + 2Card(Y) — 2 = Card(Z) — 2. We thus deduce from Example 
2n— 2) 


the number of maximal binary prefix sets with n elements is (1/n) (7-7). 


chapke? 24 .2 

MES The set Z = RA — R is a prefix code because R is prefix-closed. To prove 
formula Z = (XNQ)U(XNY)U(PNY), we use that X = PA— P and Y = QA- Q. 
Thus a word in RA — R is either in X NY or in X but not in Y and thus in X N Q or 
in Y but not in X and thus in PAY. If X,Y are maximal, P and Q are the sets of 
their prefixes. Then R is the set of prefixes of Z which is thus maximal. 


Ties The operations obviously preserve the family F of recognizable maximal prefix 
codes. To see that it contains all of them, consider an element Z of F. Let A be the 
minimal deterministic automaton recognizing Z Æ A. We argue by induction on the 
number of edges in A. We consider two cases. (i) there exists a nonempty word w such 
that i -w = i. In this case, let X be the set of first returns to state i, and let Y be 
the set of words which are labels of paths from 7 to a terminal state that do not pass 
through i inbetween. Then Z = X*Y. Next, X UY is in F in view of case (ii) below. 
(ii) otherwise, let Z = aX +Y for a € A such that a ¢ Z. Then X and a +Y are 
recognized by automata with strictly less edges than Z and the conclusion follows. 


. section2.5 
Section 


MET Tet us first assume (i). The code X is semaphore since A*X C XA*. If the 
property of the minimal set of semaphores S = X — A*X stated in condition (ii) does 
not hold, there exist two overlapping words s,t € S, i.e. such that s = uv,t = vw with 
nonempty u,v,w. Then sw = ut is in A*X but not in X*, a contradiction. Conversely, 
if X satisfies (ii), consider a word w € A* and x € X. Since two occurrences of words 
in S do not overlap, wz is a product of words in X. 
25.2 

e first inequality is clear since x € J”, y € J™ imply zy € J"*™. To see the 
second one, we observe that if ry € JP, there exist u,v € A* and n,m > 0 such that 
x E€ Ju, uw E€ J, y E vJ™ and p=n+m +1. Since x € J”u and J is an ideal, one 
has x € J”. Similarly for y. Then n < ||2||, m < ||yl| and thus p < ||x|| + ||y|] + 1. 


. section2.6 
Section 


26.1 
or any finite maximal prefix code X, there is an integer n be such that 


A*a” C X*a*. Since a € X, we have A*a” C X*, showing that a” is synchronizing. 
26.2 
ince X is synchronous, there are at least two states p,q € Q such that 
p-w = q-w for some word w. If |w| > n?, all the pairs (p-r,q-r) for r running through 
the |w| +1 > n? prefixes of w cannot be distinct. Thus there is a factorisation of w 
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in w = rst such that p-r = p-rsandq:-r=q-rs. Then p-rt = q-rt and thus we 
can choose a shorter w. We can therefore choose a word w1 of length < n? such that 
Card Q-n—1. Next, there is at least one word w2 of length at most n? such that there 
exist two states p,q E€ Q- wy, with p- we = q- w2. Continuing in this way, we obtain a 
word w,w2--+ of length at most n? which is synchronizing. 


mes We have to show that Z™* C U. Let w € A* be such that that uw € D. There 
is a v € A* such that uwv € X*. Since Z’ is prefix, we have wv € Z’*. Since Y* is 
right dense prefix, there is some s € Z’ such that wus € X*. This shows that w € D 
and thus that u`1D C D. Let then w € D. There is some v € A* such that wv € X*. 
Since uwv € Z™* and since Y* is right dense, there is an s € A* such that uwvs € X*. 
This shows that uw € D and it follows that D C u~'D. We have thus shown that 
w € U and thus that Z* C U. 


: section2.7 
Section 


et Let P = XA” be the set of proper prefixes of the words of X. For p € P, 
let T(p) = J pucx o(pu). For an arbitrary word w, let us define m(w) = a(x)z(p) 
with x € X* and p € P such that w = xp. Then 7 is a cylindric measure on A* and 
m(xy) = 1(x)r(y) for x,y E€ X*, as one may easily verify. 


27.2 
MA We have 


-|z| 
H(X) — A(X) = X x(a) log, TF 
rEX 


Since log(t) < t — 1 for all t > 0, we obtain 


H(X)- A(X) < (` k) -1=0 
rExX 


since J zex klel = 1. Since log(t) < t— 1 unless t = 0 we obtain that the equality 
holds iff n(x) = k~!*! for all z € X. Finally, 


H(X) -logn = > n(x) logy —— < (SD =)-1= 


st nt (x) 
. section2.7bis 
Section z 
chapte?27bis.1 
MEN Leu be the length distribution of ; A {paximal prefix code on k letters. 


Then condition (i) holds since, by Theorem [I , we have 7(X) = 1 for any positive 

Bernoulli distribution. Let w be a word which is not a factor of the words of X and let 

p = |w|. Let P be the set of proper prefixes of X. Then v is the length distribution of 

P. Since no word of P can have w as a suffix, we have Uni p < Un(k? — 1) for all n > 1. 

This proves (ii). aaia 
Conversely, let us build a maximal prefix code X as in the proof of Theorem i ean 

using the following strategy: Fix a letter a in A, and for each n > 1, choose the words 
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of X N A” among those which have a suffix in a* of maximal length. To prove that a?? 
is not a factor of a word of X, it is enough to prove that for each n > 1, one has 


2p 
Un S X Un+i + 
i=1 


Indeed, for each proper prefix q of length n there is a unique exponent m(q) such that 
ga’ is in X. This gives vp, words in X, each of which has length > n. In view of the 
inequality, one may chose an exponent m(q) between n + 1 and n + 2p for each prefix 
q. 

To prove the above inequality, we start from Vn4p = Unk? — X$: Unik? —*, which 
results from the definition of v. Using condition (ii), we obtain 


P 
WES toes. (12.9.2) 


i=1 


Hence, using Equation (RAE) with n + p replacing n, 


p p 
Si Si 
Unk? — > Un+ik”™ = Vn+p < > Untp+ik” 
i=l =i 


and finally 


p p 2p 
Un S > Unik” =F > Untptik ” < > Uni- 
1=1 i=1 i=l 


ibSynchro 
m2 Except for the case where the sequence um is ultimately equal to one, we may 
choose the words of X in such a way that for some integer n > 1 and letters a,b € A 


1. a” does not appear as a proper factor in the words of X. 
2. the prefix code Y = X N (a* Ua*ba*) satisfies 


Y = a” U {yo, Y1; Uni} 


where each y; = a'ba*‘—*~! is a word of length A; satisfying 


and there is an integer t with 0 < t < n— 1 such that A; = n iff i > t and finally 
the numbers A; are relatively prime. 


. g onetSynchro 
Then the code X is synchronous by Exercise Bait 


Finally, if the sequence un is ultimately equal to 1, we may choose X of the form 
Y U a”a*b where Y is formed of words of length at most n. Then the word a”b is 
synchronizing. 
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chapter2bis 
Chapter f I f 
g section2bis.1 
Section i 


chapleew2bési . i 
ITY. 1 e deciphering delay of a code X is infinite if and only if there is an infinite 


word that has two disjoint factorizations. This is equivalent to the existence of an 
infinite path in Gx. In the case X is finite, this is equivalent to the existence of a cycle 
accessible from some vertex in X. 


b&s2 
MIES Tet z € X*, y € XY), z e X44) and v € A* be such that vyzv € X*. Since 
z € Z* and |z|z > pa we have z E€ S(Z), where S(Z) is the set of simplifying words 
for Z, and so zv € Z*. Since y, viewed as a word on the alphabet of Y is in S(Y), and 
since zv € Z*, we have yzv € X*. This proves that yz € S(X). 


supposing that |y| > |z|, y = ry’. Then X = YoZ will Z = dey". Since Y and 
Z are two-element codes, they have finite deciphering delay by induction hypothesis. 
Thus, X also by the previous exercise. 


that X decomposes over T. The smallest submonoid M generated by a code with 
finite deciphering delay such that X* C M is the intersection of the (finitely many) 
submonoids T* containing X generated by a code T with finite deciphering delay. 

It suffices to show that if Y, Z have finite deciphering delay, then Y* M Z* is also 
generated by a code with finite deciphering delay. Indeed, let T be the code such that 
T* = Y* N Z*. Then S(Y)NS(Z) c S(T). If d is greater thant the delays of Y and of 
Z, then T? c S(Y) N S(Z), and so T has delay at most d. 

(b) Assume for instance that Y is not a subset of X(Y*)~!. There is y € Y which 
does not appear as the first factor of a factorization of a word in X as a product of 
words in Y. Set Z = (Y — y)y*. Then Z has finite deciphering delay, and moreover 
X C Z* and Z is strictly contained in Y*. 

Finally, assume that X does not have finite deciphering delay. Consider words x Æ 2’, 
y € X? and u such that ryu € x’ X*. If dis greater than the deciphering delay of Y, then 
the Y-factorizations of x and 2’ start with the same word in Y. Thus the conclusion 
follows. 

dis 
et Y = X¢. If X has delay at most d, consider 71,...,2¢a,24,--., 24, € X, 
y € X? and u € A* such that 21 ---rqyu € z---a Y*. Since X has delay at most d, 
we have successively x} = x, £2 = x, and finally zg = z4. Thus 21 ---aqg=2)--- £h, 
which shows that Y has delay at most 1. Conversely, suppose that Y has delay at most 
1. Let 2,2’ € X, ye X! andu - A* be such that zyu € 2’! X*. Then x%y is a prefix of 
a word of x?! zy“ and thus x? = r% tg’, whence z = g. 


Extendable 
tte | Let us show first the inclusion S(X) Cc E(X). Let s € S(X), p € E(X). Note 


that pt € X* for some word t and that pt still is strongly right completable. Thus, we 
may assume that p E€ E(X) 0 X*. Consider any word u € A*. Since p € E(X), the 
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word psu can be completed: there is a word v € AT such that psuv € X*. But p is in 
X* and s is simplifying. Thus, suv € X*, showing that s is strongly right completable. 

Conversely, let s € S(X), p € E(X). To show that p is simplifying, let x € X*, v € A* 
such that xpuv € X*. Since the word pvs is right completable, we have pusw € X* for 
some w € A*. But then rpvsw € X* also and since s is simplifying, we have sw € X*. 
Thus, finally, the four words x, x(pv), (pv)(sw), and sw are in X*. The set X* is 
stable, thus pv € X*. This shows that p is simplifying. 


a We first show that P is thin proving that for each p € P and a € A, the word 
pa cannot be a factor of P. Indeed, if upav € P, then up is also in P, a contradiction. 
Next, by Lemma fema Lg have § C UZI XİP, and thus S is thin. Since R C XS, we 
also have that R į thin, à Finally, let us show that S* is thin. Otherwise, since S is 
prefix by Lemma Peme ould be a maximal prefix code. Any element of R would 
then be com ppgable for the prefix order with an element of S, a contradiction with 


Lemma 


: ction2bis.2 
Section B 
ams omataBoundar 
IRT lt is clear that if A is a (d,d’)-complete automaton with bidelay (d, d’), 


then with the pairs (Up, Vp) chosen as indicated and the sets (Ue, Ve) defined by the 
compatibility conditions 2 and 4, the result satisfies conditions 1 and 3 and thus is an 
extended automaton without boundary edges. Conversely, we show that in an extended 
automaton with delay (d, d’) without boundary edges, for 0 < k < d’ + 1, the set of 
labels of paths of length < k starting at p (resp. ending at q) is the set of prefixes 
of VpA (resp. AU.) of length < k. We prove the first alternative. The other one is 
symmetrical. The statement is true for k = 0. Assume that it holds for k < d’. Let 
p& q Š be a path of length < k +1 with a € A. Then, by induction hypothesis, u is 
a prefix of V,A and thus of V}. By condition 1, au is a prefix of VpA. This proves the 
property for k + 1 in one direction (observe that we did not use the hypothesis that 
there are no boundary edges). Conversely, if au is a prefix of V,A, by the compatibility 
condition 1, there is an edge e € F(p) such that a = A(e) and u € Ve. Since is not a 
boundary edge, we have e = (p,a,q) for some state q. By condition 4, u € V}. By the 
induction hypothesis, there is a path q >, hence a path p “5. Thus the property holds 
for k + 1 and the statement is proved by induction on k. 


astial 
HIR 2 According to conditions 1 and 2, we have 


> UpVpA= 5 UNGT 


pEQ eC hy 


where E4 is the set of edges which have an origin (i.e. which are not backward boundary 
edges). Similarly, 7 peg AUpVp = iecr_ UeX(e)Ve where E_ is the set of edges which 
have an end. This proves the formula. 


examplesExtAuto 
Tt Ba I he automaton Ag is clearly a (d, d')-complete automaton with bidelay (d, d’) 


and thus an extended automaton (without boundary edges). For all u € Af and v € AT, 
there is a path p S q È r in Ao if and only if q = wv. 
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It is not difficult to verify that A_, and A, still satisfy the four conditions defining 
extended automata. In A_,, the set of forward boundary edges is Ax and the set of 
backward boundary edges is xA. Thus J ecg (e) = Av — 2A = —fr. The forward 
boudary edges of A, are the backward boundary edges of A_, and vice versa. This 
proves the last formula. 


ite. Suppose that e is a forward boundary edge from state p with label a such 
that Ue or Ve is not a singleton. We add a terminal state q to e with Ug = A` Uea and 
V, = Ve. For every word w = a1 : ++ @qdar41 E VeA, we add a forward boundary edge ew 
starting at q with label a1, and with Ue, = A` Uea, Ve,, = {a2---aa41}. And for every 
word w = a1---dq4; in A(A Uea) which is not in Uea, we add a backward boundary 
edge el, ending at q with label aq+ı and with Ue, = a1-+-da, Ver, = Ve. Iterating this 
transformation a finite number of times, we obtain an extended automaton in which 
all boundary edges are simple. 


boBoundary — -Simple 2 
ie ot By Exercise 2 we may suppose k: the extended automaton A is such that 


all boundary edges are simple. By Exercise we have J ecg le) € L. Let us write 


X Oe) = Do be fa 


eck 


where the coefficients by are integers. 

For each z € A“ such that bẹ > 0 (resp. bẹ < 0), we add to the automaton A 
the disjoint union of by copies of A_, (resp. Az). The resulting extended automaton 
A is now such that 5>0(e) = 0. Each boundary edge e of A is simple and thus 
(e) € ATI +1, Thus, for each word w € A“ we may define a bijection 7, : {e € 
E | 0(e) = w} — {e € E | 0(e) = —w}. We now identify each forward boundary edge 
of A with the backward boundary edge 7,,(e) where w = O(e). The resulting extended 
automaton has no boundary edges. 


atabagsCompletion 
BNA for each state q, define U, as the set of labels of paths of length d ending at 


q and V; as the set of labels of paths of length d’ starting at q. For each edge e from p 
to q, set Ue = Up and Ve = V}. Since A has (right) delay d’, for each state q € Q, the 
sets aV for each edge e starting at q, with a the label of e, are disjoint. Thus we may 
attach forward boundary edges to state q to complete a partition of V,A as follows. For 
each w = a1 --+aq41 E V~A which is not in any of the sets aV., we define a boundary 
edge e with origin q and label a; with Ue = Ug and V = {a2...aq41}. In a completely 
symmetric fashion, we attach backward boundary edges to each state q in order that 
the family of sets Uea is partition of the set AU. 


bis2.9 i tended 
Tie. We first add boundary edges as indicated on Figure PE on the left (for each 


boundary edge e, we indicate the pair (Ue, Ve)). We have then $ ecg le) = É ee 


bba = — fe. We thus add the automaton Ap represented on the right in Figure 
Merging the boudary LAdges by. pairs which are compatible, we obtain the automaton 


utom: 
of Figure Bd on the right. 
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Figure 12.9: The construction of an extended automaton with delay (1, 1) 


: section3.1 
Section 


there exists (p’,x’, s’) € U such that sup’ € X* and conversely. Otherwise, there would 
be more parses for uvu than for u. This implies that (L, (uv) wu) = (L, u) for all m > 0. 


. section3.2 
Section 


E The existence of k follows from the fact that w € F(X) since then, for 
each i € {1,..., n}, aj---anw E XA”. 

(b) If X is suffix, then clearly, pw is injective. Conversely, if v, uv € X, then the map 
Pw is not injective for any w € F(X) with uv as a suffix. This proves assertion (b). 
The proof of (c) is similar. 

(d) The proof results from the fact that a map of a finite set into itself is injective if 
and only if it is surjective. 


E2 a) Set X = P — PA?t. We prove that X* = P* by induction. Let x,y € A* 
be such that « € X, xy € P*. We have x = uŭ, xy = võ. If |x| < |v|, then v = xw and 
xry = uuwwuu. Thus y € P*. Otherwise, x = vw and v = wy. Then, x = yww, which 
forces y = 1. This prouves that P* is right unitary. The proof that it is left unitary is 
symmetric. 

(b) For each u € A*, uŭ and ŭu are in P. 


.2.3 
tees Thx is recognizable, then the sets G, D, Go, Do are recognizable and thus also 
Y given by Y = (X U wU Gı(wDo)* Di) — (Gw U wD). Conversely, X = (Y — w — 
Gı(xDo)* D1) U Gw U wD, and if Y is recognizable, then X is also recognizable. 


i ction3.3 
Section 


dta 3.3.1 
We use an induction on the degree of P (we call degree of P the maximal 


length of the words x such that (P, x) Æ 0.) 
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Let P = P’+ P” where P” is the sum of all terms of maximal degree in P. Let 
S = S'+ 8" be the similar decomposition for S. We have P”A = AS”. This implies 
that P” = AR, S” = RA for some polynomial R (indeed, for any a,b,c € A and w € A*, 
(P"”,aw) = (S”,wc) = (P”,bw), and thus we may define R by (R,w) = (P,aw) for 
some a € A). We obtain after simplification (P’+ R)(A—1) = (A—1)(S’+R). We can 
suppose by induction hypothesis that P’+ R = (1—A)Q+d and $’+ R= Q(1—A)+d. 
Then P = P' + AR = (1 — A)(Q — R) + d and S = S' + RA = (Q — R)(1 — A) + d. 


.3.2 
Mp2 i is clear that each set Y; is maximal prefix. They are disjoint because if 
y € Yi NY; one of piy, pjy € X is a suffix of the other. Any suffix s of X is in some Y; 
since ws E€ A*X. This shows that S is the disjoint union of the sets Y;. 


.3.8 
e existence follows from Theorem since the decomposition build by the 
proof satisfies this property. The uniqueness follows from the fact that a suffix s is in 
Y; iff it has i — 1 proper prefixes which are in S. 


: section3.4 
Section 


cha 4.1 
e may suppose that X is not m xima Since, X is am u(X) = 


max{(Lx,x) | « € X} is finite. By Theorem or each d > p(X) + d,% is the 
kernel of a maximal bifix code Z of degree d (which is unique by sien oe us 
show that Z is recognizable. For a word w, we denote by c(w) the pair (i,s) formed 
by the integer i = (Lx,w) and the word s which is the longest suffix of w which is a 
prefix of X. It can be verified that c(w) = c(w’) implies w!Z = w™!t Z. 


cha .4.2 -3.7 
y Proposition Bra the set P’ of proper prefixes of the derived code is PA H. 


When X is recognizable, so are P = XA~ and H = A~ X A`. Thus P’ is recognizable 
and so is X’ = P'A — P”. 


x| < |s], then 7z is in the kernel of X and so is in X’. Otherwise, let s = ua 
ih a € A. Then h ¢ H = A7 XA since otherwise s would not be the longest prefix 
of w which is a suffix pf X. Thus s € (HA — H) N (AH — H) which is contained in X’ 
by Proposition 

AY it e code X is clearly (by Exercise riety a thin maximal prefix code. To see 
that it is also suffix, suppose that a word of X1 N X2A7 is suffix of a word of XN X1 A7. 
Then it belongs to the kernel of X , which the same as that of X2, a contradiction. If 
X 1, X2 have also the same degree d, 


i section3.5 
Section 


cha 


5.1 
uppose |p| < |r|. Since x is chosen of maximal length, there is a pre xo of q 
such that rwq’ € X. Thus wd’ € H(X)NS and wd’ € S$’ by Proposition B-7](3). This 
implies h € H(X’). 


J. Berstel, D. Perrin and C. Reutenauer Version 22 janvier 2008 


492 Solutions of Exercises 


1 
1 
1 
1 
1 


oe W bd 
ww Ww wb 
NI OF BY] 


Table 12.3: The values of \(k, d) 


1 2 3 4 5 
2|1 1 3 73 5056783 
3/1 1 25 
4/1 1 543 
5|1 1 29281 


Table 12.4: The values of (3;,(d). 


vib. Tet x = aub € X with a,b € A. If a word w of length ¢(X') — 1 has two 
occurrences in u, then w € H(X’) by the previous exercise, which is impossible because 
the words in H(X’) have length at most @(X’) — 2. Thus each word of length /(X’) — 1 
has at most one occurrence in u, whence |u| < ¢(X’) — 1+ k!*-1! — 1 and finally 
|x| < UX’) + kX- The formula follows directly the above. Some values of A(k, d) 
are given in Table 

For d = 3, the formula gives the exact value. Actually \(k,2) = 2 and one may verify 
that A(k,3) =k+2. For k = 4, one has \(2,4) = 8 but the bound given by the fomula 
is \(2,4) < 12. 


e function y is injective because X is suffix and therefore also surjective 
(the latter is also a consequence of the fact that X is maximal suffix). 

chajter3.5.4 

LV 5.4 or each finite maximal bifix code X of degree d, AX and XA are finite 
maximal bifix codes of degree d +1. Since AX #4 XA unless X = Aĉ, we obtain 


By (d+1) > 28k(d)—1. Since G(k, 3) > Zdor k 2 2, the conclusion follows. Some values 
of 6;(d) are represented on Table i 


[Rome engh distribution 
=o oro a 


Table 12.5: The 3 finite maximal binary bifix codes of degree 3 


equal. Thus it has a factor of the form uvu where u is of length an. The claim follows 
by induction. 
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exercise, x contains an internal factor which is a quasipower of order d — 1. Since, 
by Exercise 1.1, (L, uvu) > (L,u) for any internal factor uvu with u Æ 1, we obtain 
(boo) > d which is impossible. The bound is less accurate than the one given by 


Exercise Boat 


length distribution symmetry class 


S 
© 
= 
(oz 


Ka 
E 
ro] 


C komd [Tenth distribution [symmetry dass | 
B) a piosse | > | 
0 ae ee 
D 
0 po o 


Co 
o0 


A 
N| Aje 


A E SE S S 44[ 4 | 
ab,tha,aaba,aabb [0 115 39 E S 4 | 
ab, baa, bba 


Table 12.7: The remaining 34 finite maximal binary bifix codes of degree 4 


Aje 


9 
5 
6 
2 
9 
6 
6 
3 
4 


bo 

bo 

co} co] co] œ| œ| co} cx] oo 
ajeje 


bo 
N 


ET We will describe the 73 finite maximal binary bifix codes of degree 4 according 
to their rived code. The 3 finite maximal binary bifix codes of degree 3 are given by 
Table pei Ihe table is made of 3 columns describing the code. The first one gives 
the kernel of the code, the second one its length distribution. The third column gives 
the number of codes obtained by the symmetries consisting either in the exchange of 
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the letters a,b or the reversal of words. There can be either 1,2 or 4 such symmetrical 
codes. In this way we reduce the number of codes to be listed and and we list only one 
representative of each symmetry class, the third column giving the number of elements 
of the class. For example, there is just one code with empty internal part, namely A®. 
There is one code with kernel {ab} and one with kernel {ba}. The symmetry class has 
two elements, in correspondance with the fact that ab and ba are both obtained one 
from the other by reversal or exchange of a,b. 


Bip41 
There are 39 bifix codes with derived code A® listed on Table rg We may observe 
that the length distribution can be read from the internal part as follows. The fact 
that the code X on line 5 has 4 words of length 6 corresponds to the fact that the 


internal words aab and aba overlap on ab. Thus, aaba is an internal factor of X and 
{a, b}aaba{a,b} C X. 

The remaining 34 bibrefix codes have a derivative with kernel {ab} or {ba} (there are 
17 of each kind). They are listed on Table asleiiel fact that the code X on line 23 
has 4 words of length 8 can be read as follows on its internal part. The word abbaab 
has 2 interpretations, namely (ab)(baa)b and a(bba)(ab). Thus it is an internal factor 


and {a, b}abbaab{a,b} C X. 


ab bab 
R 2 
bab aban 20" ba? 
ab 
ba? ab? b2a ab? ab 
ab 
a2b? 7 J a20? 
bab 


G4) 08) ood B 


Figure 12.10: The generation of finite maximal bifix codes of degree 4 by internal 
transformations. 


We have represented on Figure (eto the generation of the finite maximal bifix codes 
of degree 4 by inte trang ation. The labels of the nodes are the indices of the first 
column in Tables a and hi k Fach edge corresponds to an internal transformation. 
The label of the edge is the prefix used. We have only represented a part of the acyclic 


graph of internal transformations which is actually a covering tree of this graph. There 
are only three nodes without successor in the complete graph, which are 18, 20 and 23. 
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anReatslemTower . . 
e formula is direct consequence of 


X-—1=(A-1)\(4+T(A-1)), 


d P tes F 
where T = Diet Ri. Here all operations are on characteristic series. 


the square of the mean of the lengths, i.e. vx = ae n Unk” — d. Since u(z) = 
ind Unz”, we have zu (2) = = nl nunz”, whence u!(z) + zu"(z) = ips unz! 


Finally, by Problem Ba uty 1 = (kz — 1)d + (kz — 1)°t(z). Derivating twice, we 
obtain u”(1/k) = 2k?t(1/k). 


chapter7 
Chapter Wi 
: Isection7.1 
Section i 


A Let X = {ab, ba}. Let x € A* and n > 1 be such that 2” € X*. If x ¢ X*, 
then x has more then one X-interpretation. This forces x € F'(ab)*, a contradiction. 


“my ~n=z7 
let i) We argue by contradiction and suppose that x, y, z are primitive and distinct. 


First observe that |x| < |z|. Indeed, otherwise x would have two distinct z-interpretations, 
which is impossible. In the same way, |y| < |z]. 

Let us first prove that the conclusion holds if p > 3. We consider the conjugate z’ 
of z which is a Lyndon word. Then 7’ is either a factor of x” or of y”. In both cases, 
since z’ is longer than x and y, this implies that z’ is bordered. This is a contradiction 
since a Lyndon word is unbordered (Propesition FEE 

Let us finally consider the case p = 2. We may suppose that |x| > |y"|. Then we 
have x” = zu, z = uy” for some word u. Thus 2” = uy”u. But this implies that, 
changing z by some conjugate x’, the equality 2’” = u?y”. By induction, we have 
x',u,y € t* whence the contradiction. 


lchdgente7 71.2 
We suppose |x| > |y|. We may also suppose that x and y are primitive (since 


otherwise y*x U x*y contains an imprimitive word). If X* is not pure, there exists 
u Z X* such that u” € X. Let w = u”. We may suppose that w ¢ x* U y* since 
otherwise x or y is not primitive. 

We consider the least integer i > 1 such that w? € X*ay'xX*. Replacing w be an 
X-conjugate, we may suppose that yx is a prefix of w and x a suffix of w. Let w’ be 
an X-conjugate of w such that wh = hw’ and with h shorter than the word z € X such 
that w’ € X*z. We distinguish several cases. 

1. w € yX*x. By definition of the integer i, one has w’ € y?X*x. Let k,k’ be 
such that xh = kx and y'k’! = hy’. Since k and k’ are prefixes of x of the same 
length, k = k’. Thus ytzh = ytkx = y’k'x = hy'x which shows that y‘x is not 
primitive. 

2. w € aX*x. Suppose first that |hx| > yt. We have in fact w’ € zy? X* N X*y?a 
since otherwise z would be a nontrivial factor of x?, a contradiction with the 
hypothesis that x is primitive. Since y’z is a suffix of w’, there exists k such that 
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CAJ CK Ck]J CAJ 
CILY] 
————— ow ë 

Figure 12.11: Case 1: w’ € yX*x 

[YTY Tt | 

CAJ BAA LE] Lh] 
[| x TUT | [YTY] T | 
EE e) 


Figure 12.12: Case 2: w’ € xX*x and |ha| > y’ 


yx = krh. Since yz is a prefix of w, there exists k’ such that y’x = hak’. Since 
|k| = |k’|, and both are prefixes of yt, we have k = k’. Thus yz? = hakax = krhx 
is imprimitive. Since x,y are not powers of a common root, we have i = 1 and 
yx? is imprimitive. 

If |kæ| < |y*|, then i > 1. We have w € X*y?°x since otherwise g is a nontrivial 
factor of z?. And w € X*gz? since otherwise y is a nontrivial factor of y?. Thus, 


B a | 
PA m PA w A Al [Al 
Cz a 7] CJ [Y] 7] 


a a ee R) 


Figure 12.13: Case 2: w’ € xX*x and |ha| < |y’| 


there is a prefix k of yt such that ytz = kx7h. 

If w € zy? X*, then there is a prefix l of y’ such that ytz = hex. Since |k| = |4], 
we have k = £. Thus ytz? = hkz? = kz?hz is not primitive, which is impossible 
since 7 > 1. 

Thus w’ € £? X*. If |hx?| < yt, then z is a factor of y* with two y-interpretations, 
a contradiction with the fact that x is primitive. Thus |hx?| > y’. Since z 
has only one y-interpretation, we have h = k. Thus ytz? = (ha?)?, which is 
impossible since i > 1. 


3. w € X*y. Suppose first that |ha| > |y’|. Then there is a suffix k of x such that 


CA] FAA ÆTLA] 
L © [TYTY | CI Ly] 
E 


Figure 12.14: Case 3: w’ € X*y and |hkz| > |y"| 


y? = kh and a suffix k’ of x such that ytz = hæk’. Since |k| = |k’|, we have 


Version 22 janvier 2008 J. Berstel, D. Perrin and C. Reutenauer 


Solutions of Exercises 497 


k = k'. Thus yz = hzk = khz is not primitive. 
Suppose now that |ha| < |y’|. Then i > 1 and there is a prefix k of yt such that 
yt = kzh. If w € yt X*, then there is a prefix £ of yt such that yt = hzl. Since 


hl Z A m A A A FA 
C2 y7] Ly Ty | CYL] 


Figure 12.15: Case 3: w € X*y and |ha| < |y"| 


\¢| = |k|, we have k = £. Thus ytz = kaha = hzkz is imprimitive. 

Finally, suppose that w € x? X*. If |hx?| < |y"|, then x has two y-interpretations, 
which is impossible since x and y are primitive. Thus |hx| > |y*|. We cannot 
have w’ € 2°X* since otherwise x has two z-interpretations. Thus w’ € 2x2y’. 
Let £ be the prefix of yê such that ytz = hx. Since |k| = ||, we have k = £. 
Thus ytz? = hx*kx = kxhx? is imprimitive, which is impossible since i > 1. 


Mita Suppose that X* is not pure. Then x*y U y*x contains a word which is not 
primitive. Suppose that z”y = z™ for some n > 1 and m > 2. If (n — 1)|z| > |z| 
then z™ and x” have a common prefix of length n|x| > |z| + |z|. Thus x and z are 
powers of a common word by Fine-Wilf theorem, a contradiction. Otherwise, we have 
(n — 1)|z| < |z—. Since |z| = |y| we have (n + 1)|z| = m — z|. Thus (n-—1)m<n+1 
or equivalently (n — 1)(m — 1) < 2. The only case remaining to check is n = m = 2. 
Suppose that |x| + |u| > |z|. Then u = rs with z = uvr = svvu. It follows that 
|r| = |v| + |s|. Thus sur = vrs which implies that s,r,v are powers of the same word, 
a contradiction. The case |x| + |u| < |z| is similar. 


. Isection7.2 
Section 


n > 1 be such that 2” € X*. Bet e be the idempotent in ee Then oe je = ep(x) = 
e and thus x € X*. Thus X* is pure. Conversely, let e € M be an idempotent and let 
G be its H-class. Let w € y~!(G). We may suppose that w ¢ F(X). There is an n > 1 
such that p(w”) = e. Let p be a fixpoint of e. Since X is finite, there is a factorization 
w” = uv such that 


p>1i>p 


and thus such that vu € X*. We have vu = (rs)” with r,s such that w = rs. Since 
X* is pure we have rs € X*. Thus p is also a fixpoint of w. This shows that the group 
containing e is trivial. 


= (Q,1,1) be a trim unambiguous automaton 
recognizing X* and let p = yy. Let J be the 0-minimal ideal of M = y(A*). If X 
is circular, it is uniformly synchronous. Thus the set T = y~!(M — J) N X* is finite. 
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The minimal generating set W of the ideal y~!(,J) is also finite and so are the minimal 
generating set U of the right ideal (J N yp(X*))M and of the left ideal M(J N y(X*)). 
They satisfy moreover 


X* = TU (UA* AAV) — AW A*. 


Conversely, if X* satisfies this property, it is clearly uniformly synchronous and thus 
circular. 


Wes G) Let us suppose for instance that X is (1,1)-constrained. Suppose that 
ugu, uuz E X*. We may assume upg, u1, u2 Æ 1. If they belong to X, then u; € X*. 
Otherwise, if for example upu, Z X, then uo = zu, uz = vy with x,y € X* and w € X. 
Then v € X* and thus u € X*. 

(b) X is (3,0) constrained since ugu, u1u2, u2u3 € X imply uo = ug = 1 or wy = 
u3 = 1. It is not (3,0) limited since it is not prefix. 


Meaty and (ii) are clearly equivalent. To prove that (ii) implies (iii), we first have 
that X is a semaphore code since (ii) implies A*X C XA*. Let S = X — A*X. If 
uv, vw E€ S, then wv, uvw E€ A*X imply w € X*. This forces v = 1, thus proving (iii). 
Conversely (iii) implies clearly (ii). 


MES Tet X be a recognizable circular code. Let y : A* — M be the morphism on the 
syntactic monoid of X*. We show that X is (p,p) limited with p = Card(M) + 1. Let 
indeed uo, U1,...,U2p E A* with uj_iu; € X* for 1 < i < p +q. We first observe that 
for any 2,7 such that 0 < i < j < 2p, if u;,uj E€ X*, then up € X* fori < k < j since X 
is a code. Now, since y(ug),..-, (up) cannot be all distinct, there is a indices j, k with 
0<j<k<psuch that (uj) = (ux). Then, since X is circular uj, uj41,..., Uk E X*. 
In the same way, there exist two indices ?,m with p+1 < £ < m < 2p such that 
plue) = Y(Um) and thus ug, uez1,.-.,Um E X*. This implies up E€ X*, proving the 
claim. 


. section7.3 
Section 


F Suppose that for £z1,..., £k E Ln and y1... Yk E Ln we have z1: -£k = 
sy2°--ypp and yı = ps with ps # 1. Then x < yo < z2 < ... < Tk < Yı < T1, a 
contradiction. Thus L,, is circular. 

The set Lə is comma-free only if k < 3 since for k = 4, (ab)(cd) = a(bc)d with 
ab, be, cd € Lə. The sets L3, L4 are not comma-free for k > 3 since (aab)(bbc) = a(abb)bc 
and (aaab)(bbbc) = a(aabb)bc. 


demt A 5 

We have by Proposition EE with P = XA, Ps = XR whence upt = uxup. 
Since P(A — 1) = X — 1, we have up(kt — 1) = ux — 1. The formula for ux follows. 
The formula also follows easily from t? + ktux = ux + uyt. 


3.3 | . leNewton 
. is is a direct consequence of Formula ( ; 
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73.3bis 
be a circular code on a suitable alphabet B such that u, = Card(X N B”) 


(the alphabet may be infinite). One may define a one-to-one correspondance a : A > X 
between A and X such that the weight w(a) of a is the length of a(a). Then the result 
follows from the fact that for any z € A* 

e z is primitive if and only if a(z) is primitive, 

e w(z) = |a(z)l, 


e y € A* is conjugate to z if and only if a(y) is conjugate to a(z). 


73. 3ter 
Mea Leta and B be two weighted alphabets such that A (resp. B) has un (resp. 
Un) letters of weight n for each n > 1. Since un < vp, we may suppose that A C B. 


Then the set of primitive necklaces of weight n on A is a subset of those on B. 


73.4 
H ne has 


n>1 n>1 din 
>- 7 (vaz7)° 
d,e>1 
= X log(1 — v-d) 1 
d>1 


whence the formula by taking the exponential of both sides. 


73.5 le3 
EA The right-hand side of Equation (63d) may be rewritten as [](1 — 2!”!)~1 


where the product is over all primitive necklaces v meeting X*, in some fixed decreasing 
ordering of these necklaces. This in turn is equal to [] >) 2"!”!, which is the sum 
of all monomials 2”!1!... 2”*l¥«l| for all integers k, n1,...,n% and necklaces as above 
with vı >--- > vg. For the second solution, one uses the fact that a free monoid has 
the complete factorization of Lyndon words, that these are in bijection with primitive 
necklaces, and that primitive necklaces within X* coincide with primitive necklaces of 


A* meeting X*, since X* is a very pure submonoid. 


e last factorization is proved by induction on n, together with the fact that 
eh C; is contained in A’ and that X„+1 has only words of length at least n+ 1. The 
case n = 0 is clear. If it is true for n, then define C41, Xn+2 as indicated and verify 
the previous properties, using the bisection H* = K*((H — K)K*)* where K C H. 
The finite factorization above leads to the infinite factorization X* = CTC} ---Ch---. 
To deduce the nonnegativity of the integers vn, apply the homomorphism sending each 
letter in A onto z. 


H is rational, then X* too, and it is easy to show that the conjugation closure 
of a rational language is rational, by using the syntactic monoid of the language. Since 
X* is very pure, its conjugatio closure is a cyclic language. Now, the generating 
function of X* is by Equation E ual to the zeta function of its closure under 
conjugation. 
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To show that the zeta function of ac clic Janguage L has the indicated & ansion, 
ar (ina 


proceed as in the proof of Proposition rst, one has Equation hen one 


shows by taking the logarithmic fenaa that the equality of i: zeta action with 


the right-hand side of Equation (| 1s equivalent to Equation ( 


chapter4 
Chapter Vv i 
: Isection4.2 
Section i 


Thus such a path is unique. 


i section4.3 
Section 


VIRT For any (u,v), (u’v’) € D such that (u, Ho '), one has also (u, v), (u’,v) € 


v 
D and (u, v)olu, v’ ya ,v). Indeed, since (u, ip (u’,v’) there are n,n'm, m’ € N such 
that 
nu=nu, um=v'm’. 

Multiplying the first equality by v’ on the right and the second one on the left by u’, 
we obtain nuv’ = n'u'v'! € N and uv'm’ = uvm €E N. Since N is stable, this implies 
uv’ € N. Thus (u,v’) € D and (u,v)p(u,v’). A similar proof holds for (u’, v). 

Since (1,n)p(1,n)p(1, 1) for any n € N, N x N is the class of (1,1). 

All we have to verify is that y is well-defined, in the sense that (U,V)p(U’, V”) iff 
there are u € U, v’ € V’ such that um € U’ and mv’ € V. Let us consider r € U 
and s € V’. Then (u,mv’)p*(r, mv’) and thus rmv’ € N. Moreover, since (u, mv’) = 


(uo, Vo) P(U1, vi)p... pluk, vk) = (r, mv’), we obtain (um, v’) = (uom, v’)p(urm, v’)p... plugm, v’) = 


(rm, v’). Thus rm € U’. The proof that ms € V is similar. Thus y(m) is well-defined. 

If M = A* and N = X*, the classes of p* are the sets X*u x vX* for u,v Æ 1 such 
that uv € X. Thus the classes are in bijection with the states of the flower automaton. 
The action also coincides (by Proposition 2.2). 


cha .3.2 
e condition is obviously sufficient. Conversely, let c be a n x p matrix such 


ee its columns form a basis of the E of m. Then m = cl in a unique way. The 
matrix n = lc is invertible and satisfies n? = n?. Thus n is the identity. 


(H*m)Smwg = lamcylycK 
= lagma)yclayicK 


leayma)payicK 


= lymcK 
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The last equality comes from the fact that the right multiplication by a‘,,az is the 
identity on H’ and eaym € H’. The proof that Syg'(m * H) reduces to the same 
expression is similar. 

Consider the map p from D to A x Ge xT associating to m € D the triple p(m) = 
(K,g,H) defined by m € KM N MH and g = rb'gma'yl. It is one-to-one because 
m = lggry. It is a morphism since form € KMNMHAND and m’ € K'MAMH'AD, 
we have 


p(m)p(m’) = (K,rbgmayl, H)(K', rbgim ayl, H”) (12.9.3) 
= (K,rbgmayrapybglrb gim ayl, H’) (12.9.4) 
= (K, rb'gmm ayl, H’) = p(mm’). (12.9.5) 


via (a) Choose 


0 -1100000 00100 0 0 0 
(eee eee ue eal 
1 0 000000 10010 0 0 0 
„lo 1000005 een 0 d 
0 0 001111? 00 0 0 1 -1 -1 <i 
Eri tS ooo a l 
GRETE a l 
0 0 000001 00000 0 0 1 


(b) Set M = p4(A*). For q € Q, let ug E A* be such that q “4 q and that eq = PA luq) 

is an idempotent of minimal rank of M. Since p is a reduction, there exist p,p’ € 
p—'(q) such that p 3 p'. Since eq is idempotent, there is a fixpoint sq of eg such that 

p 1 Sq i p'. By unambiguity, we have p(s) = q. Let eg = lọrq be the column-row 
decomposition of eg. Define A(p) = q if p(p) = q and (sq, p) € rq. Next, define u(p) = q 

if o(p) = q and there is a fixpoint s of eg such that (p,s) € l4. Let q S q' be a path i Pace 
B and let m = y4(w). Then (q,q') € egme,y and thus egmey #4 0. By Proposition p 
the relation rọmlg is a bijection from the set of fixpoints of e, on the set of fixpoints 

of eg. This shows that the pair (A, 2) is an unambiguous realization of p. 


i section4.4 
Section 


an xr matrix whose columns form a basis of the columns of m. Conversely, if m = cl 
with c € K”"*” and l € K"*” then the columns of c generate the columns of m. 


aa 100 1 
1 1 0 

m= 0 1 0 -1 
n 0 0 1 1 
0 0 1 
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and thus the rank over Z is 3. It can be verified that there is no such decomposition 
with nonnegative coefficients. 


We treat the case where M does not have a zero. Since RN LON is a 
subgroup, it contains the idempotent e of RN L. In the same way the idempotent e’ of 
R'OL’' isin N. Thus ee’ isin NO ROL’. 


(ii). Indeed, let m be of minimal rank and let v = mp, be a row of 
m. For any n € M, since the right ideal mM is minimal, there is an m’ € M such that 
mnm = v. Since vnm’ = v, we have vn Æ 0. 

(ii) implies (iii). Suppose that v is not maximal and let v’ > v be a row of an element 
of M. Let q E€ Q be such that (v’ — v), = 1. Let m € M be such that w = mp, is a 
maximal row. Let n E€ M be such that ng, = 1. Then v'nm is a row of an element of 
M which is > w and thus equal to w. This forces vnm = 0 and thus 0 € vM. 

(iii) implies (iv). Let v = mp, be a maximal row. Let m’ € M have a minimal number 
of distinct nonzero rows. Let q,s E€ Q be such that mj, = 1. Let n € M be such that 
Nsp = 1. Then m’n has a minimal number of distinct nonzero rows and (m/n)qp = 1. 
Thus v is the row of index q of m’nm which has a minimal number of distinct nonzero 
rows. 

(iv) implies (ii). Let v = mp, where m has a minimal number of distinct nonzero rows. 
If vn = 0, then mn has less dinstinct nonzero rows than n. (iii) implies (i). Let v = mp. 
be a maximal row. Let n be of minimal rank with ng, = 1. Then n,, > v and thus 
Nq» =v. This shows that v is a row of an element of minimal rank. 

Observe that a matix of minimal rank r has r distinct nonzero rows and thus a 
matrix has a minimal number of distinct nonzero rows if and only if it has minimal 
rank. Indeed, let e be an idempotent of minimal rank d. Let m = Ir be the column 
row decomposition of e. Then the rows of e are sums of the rows of l. But since the 
rows of l are in particular rows of e, they are maximal. Thus all rows of e are rows of 1. 


e statement is a simple consequence of the fact that a word u is right 


ea eab if an ly if RAKA) 1x Æ 0. 
(b) By Exercise e vector y(w)i. is maximal and 0 ¢ y(w)ıxM. Thus w is 


strongly right toaplaiabie by (a). Let x € X* and u € A* be such that zwu € X*. 
Then y(zw)ix > p(w)ix implies y(xw)ixp(w)ix. Thus y(rwu)ix = p(wu)ix, showing 
that wu € X*. 
nat 

e first statement is clear. To see the converse, first observe that L and C 
contain singletons and thus, for any q € Q there is l € L (resp. c € C) such that lq = 1 
(resp. cg = 1). Next, for any £ € L and m € M, we have m € L. Similarly, for any 
m € M and c € C, we have mc € C. Let now m,n € M. For any l € L and c € C, we 
have Im € L by the previous remark and thus Imnc = (lm)nc < 1. Hence mn € M, 
which shows that M is a monoid. For any p,q € Q, let 1 € L and c € C be such that 
lp =q= 1. Then 1 > Imne > (mn)pq. This shows that M is unambiguous. Any product 
cl for c€ C and £ € L is in M since for any c’, U € C x L, Ced = (lo) (l£d) < 1. Thus 
M is additionally transitive. This proves (i). 
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To prove (ii), consider a transitive monoid of unambiguous relations on the set Q. 
Let L (resp. C) be the set of rows (resp. columns) of the elements of M. Let L’ be 
the set of all row vectors £ in {0,1}@ such that £c < 1 for all c € C. Then L'/M = P. 
Indeed, for any £ € L’, m € M and c € C, we have mc = &(mc) < 1 because MC = C. 
Thus ém € L’. Next, let C” be the set of column vectors | in, {0, 1}@ such that £c < 1 
for all 2 € L’. Then L’ and C” satisfy the condition GF et N be the transitive 
monoid of unambiguous relations formed of all n such that nc < 1 for all £ € L’ and 
c € OC’. For any L€ L', m € M and c € C’, we have mc = (€m)c < 1 since 4m € L. 
Thus M is a submonoid of N. 


cha nat2 
et e be an idempotent of M of minimal rank with column-row decomposition 


e = cl such that u is the sum of the rows of l and v is the first column of c. Then Ime 


is a permutation and thus umv = 1. 
rnat 


The rest of the proof is the same as that of Exercise 


(ii) The first equality comes from the two ways to express the set of pairs (q, w) for 
q E Q and w € U. The second one is analogous. The first equality on the second line 
corresponds to the one-to-one correspondance between an element w € U and the set 
of pairs (q,c) E€ Q x V such that wN c= q. 

(iii) For each pair (w,c) € U x V there is a unique pair (p,q) in Q x Q such that 
Wp = Mpq = Cq = 1. We conclude that 


pq n 
hk rs 


t 


such that ¿mce < 1 for all € L and c € C. Let U (resp. V) be the set of maximal 
elements of L (resp. C). We consider the set R obtained by adding to Q a set pu of 
elements in one-to-one correspondance with U. We form the set U’ of subsets of R 
obtained by adding to each u € U the element pu. We also denote by U’ the set of 
characteristic vectors of the sets u € U’. Let V’ be the subset of {v € {0,1}* | wv < 
1 for all u € U’} which are maximal. One has actually uv = 1 for all v € V” and u € Ọ’ 
since v contains either an element of u or the element pu. 

Let us show that for any m € {0,1}"** such that wmv < 1 for all u € U’ and v € V’ 
and which is maximal for this property, one has actually umv = 1 for all u € U’ and 
v € V’. Suppose indeed that umv = 0. For any q € v , there is a pair (r,s) € m and 
a pair (u’,v’) € U x V’ such that r € U and q,s € V’. When q runs through v, the 
set of states s forms a set u’ which is such that u’v < 1 for all v € V’. Suppose that 
u’ and v have a common element k. Then, chosing q = k, we obtain that u'v' > 2, a 
contradiction. Thus u’v = 0, which is also a contradiction. This proves the claim. 


lchardxeqAC 1 
VI.B.11] Suppose that m,, = 1 for some p,q € Q. Since M is transitive and does 
not contain zero, there exists a maximal row r such that r, = 1. Let us assume that 
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T = fisy i some 1,6 M. Then nm < nm’ and (nm). = Nsm is a maximal row by 


Exercise Us (717) sx = (nm’)s,. This forces Mpg = 1 since m < m. 


Let p E€ Q and u € A*, be such that y(w)>. is not a maximal row. Since A is 
complete and strongly connected, there exists a maximal row r such that rp = 1. There 
is at least a state p’ distinct of p such that ry = 1 and y(u),. 4 0 since otherwise 
ry(u) is not maximal. Hence there is a state q E€ Q and a word v of length at most 
n(n —1)/2 such that q > p and q > p’. Then y(u)px < y(vu)gx. This proves the claim. 

By the claim and its symmetric, there exist pairs (p1, u1), (p2,u2), .--,(Ps,Us) in 
Q x A* and (v1, qı), (v2,q2), .-- , (Vt, qt) in A* x Q such that, with x; = (u;i -+ u1)p;x 
and yj = g(v1 ++" jeg 

(i) u =v, = 1 and pı = qı- 

(ii) for 2 < i < s, the word u; has length at most n(n — 1)/2 and x; > zi—1. 

(iii) for 2 < j < t, the word v; has length at most n(n — 1)/2 and y; > yj-1. 

(iv) £s is a maximal row and yz is a maximal column. 
Let u = us...u, and v = v1... v We have |u| < (s — 1)n(n — 1)/2 and |v| < 
(t —1)n(n —1)/2. Thus |uv| < (s +t— 2)n(n — 1)/2. Since A is unambiguous, we have 
Tsy = 1. 

Thus s +t < )ijeg(@s)q + Migcg(¥tla < n+1. Let finally z € A* be such that 
qi — ps with |z| < n— 1. Then w = vzu is such that yxs < (w). By Exercise Bart 
this implies y(w) = yt£s, whence the conclusion. 


: section4.5 
Section 


We treat the case where the code is complete. Let A = (Q,1,1) be an 
ambiguous trim automaton recognizing X*. Let K be the set of minimal rank of 
M' = vy(A*). There exists a morphism 7 from M’ onto M such that y = yya. Then 
J = ~(K) is the minimal ideal of M and the other properties follow from the fact that 
they hold for K. 


5.2 

VEA huon) = u(n), then for any H € A, we have H-m = H-n and H*xm = Hn. 
; ER . = = 5 1 — 

== . $ — = 5 1 = 
Let H H -m. Since H *m =Iymcq and H * n = lyney, we obtain lymapc 
lyna',,c. Multiplying on the right by lay we have lyma',,clay = lyma',,clay whence 
lym = lyn since xa‘,,ay = x for all x € H. This proves the equivalence concerning 
pt. The other one is proved in the same way. To prove is that the function m > 
(u(m),v(m)) is injective. 


ViA- Let X be a prefix code and let e be an idempotent of J. Suppose that 
Men y(X*) # @. Let f € Me be an idempotent in y(X*). Then fe = f implies 
e € p(X*) since p(X") is right unitary. 

Conversely, let u,v € M be such that u, uv € Y(X). We may assume, multiplying 
u on the left by an element of J N y(X*) that u € J. For any n > 0, we have 
(uv)” ti € X* and thus (vu)” 4 0. Let e be the idempotent in (vu)+. Since the left 
ideal Mu is minimal and since e € Mu, we have u € Me. Thus Men y(X*) 4 0, which 
implies e € y(X*). Since X is a code u, uv,e € y(X*) imply v € X* by stability. Thus 
X is prefix. 
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: section4.6 
Section 


We treat the case where the code is complete. Let A = (Q,1,1) be an 
ambiguous trim automaton recognizing X*. Let K be the set of minimal rank of 
M' = vy(A*). There exists a morphism 7 from M’ onto M such that y = yya. Then 
J = w(K) is the minimal ideal of M. Let G” be an H-class in K such that ¢)(G’) = G. 
Let H’ = G' N va(X*). The restricion of p to G” is one-to-one and w~(H’) = H. This 
proves the claim since G(X) is the represented as a permutation group as G” acting on 
the right cosets of H”. 


et u E€ A* be a word which is not a factor of X. Then, for each integer i > 1, 
there is a prefix p; of u and a suffix s; of u such that szt, z'p; € X*. Since there is a 
finite number of pairs (s;, pi), there exist integers 1 < j such that p; = pj and s; = sj. 
Then s;zttip; = (siz) (zf pj) = (s;27)(z*p;) imply z~t € X*. 


completabies in iZ * (and thus in X*) ad a word y € Z* strongly left completable in Z* 
(and thus in Y*), which proves that the condition is satisfied. 

Conversely, the existence of y € Y* strongly right completable in X* shows that X 
is complete. Thus, there exists z’ € A* strongly left completable in X*. Similarly, 
there exists y’ € A* strongly right completable in Y*. Let u = x'x and v = yy’. Then 
u is strongly left completable in both X* and Y* and v is strongly right completable 
in both X* and Y*. Thus, for any w € A*, the word vwu is both strongly right and 
left completable in X* and Y*. It follows from Exercise at some power of uwv 
is in Z*. Thus Z is complete. It is moreover thin since Z* is recognized by the direct 
product of automata A and B recognizing X* and Y* (which has finite minimal rank 
as A and B). It is thus a maximal code. 


containing Z (Exercises 1.5.5 and 1.5.6). Let u, v be two distinct words in T which are 
not in Z (the method used to build T adds an infinite number of words). Let 


X = Z+u4+((T-Z)-u)(T—u)*u 
Y = Z4+v4+((T-Z)-v)\(T—v)*v 


Then X and Y are obtained by composition as maximal rational (resp. thin) codes. 
Clearly Z* C X*M Y*. To show the converse, let w = t1 ---tn E X*NY* with ti ET. 
Suppose that w ¢ Z*. Then u and v appear among the t; and the uniqueness of the 
factorization forces u = v, a contradiction. 


. section4.7 
Section 
UA 
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chapter4bis 
Chapter V T) 
i section4bis.1 
Section B 


W. Since x ¢ W, there exist u,v € A* such that uzv € X*. Let x = ry = zt with 
r,t € A* and y,z € X®. Then uzty € a implies ztv € X*. And ztv = ryv € X* 


implies x = ry € xX This proves 3 

To prove i onsider a word v € V. Suppose that v g W. Let n be the least 
integer such that v is a factor of X”. Then wow = £1£2-:- £n for some u, w € A* and 
xi € X. By the definition of V we have n > s + 2 and by the minimality of n, u is a 
prefix of xı and v is a suffix of zn. Thus z2---£n—1 is a factor of v, a contradiction 
with the fact that v does not have a factor in X**!. 

To prove the opposite inclusion, let w be a word in W without any proper factor in 
W. We have to prove that w does not have a factor in X5+1, If w € A, the conclusion 
holds. Otherwise, let w = ahb with a,b € A and h € A*. Let us first suppose 
that h has a factor in X°. Since ah,hb ¢ W, there exist u1, u2,U3, u4 € A* such 
that ujahu2, uzhbu4 E€ X*. Since h has a factor in X*, we obtain by synchronization 
uahbu4 € X* a contradiction. Suppose now that w has a factor in X°*+!. Since h does 
not have a factor in X*, the only possiblity is x € X*+, a contradiction. 

hss .1.2 0 
ssume first that X*A* N A* X5 C X*. Then by Proposition EF every pair 
of words in X* is synchronizing. Completion follows from the inclusion X*wX* C X* 
for all w, and from the fact that X is nonempty. 

Conversely, let X be a complete code with synchronization delay s. Again by Propo- 
sition every pair (x,y) of words in X°* is such that yA* N A*x C X*. 


s Y” = XU(T\W). We show first that Y’ C Y. Let y € Y’ and suppose 
that y ¢ Y. Then, since Y’ C M, one has y = y1 ::: Yn, with y; € Y and n > 2. 

At least one of the y; is not in X. Take y; ¢ X with i minimum. Then y1,...,Yi-1 € 
X, and y; € XS A* by definition of M. Hence y € X*~!*%A*, which is possible only if 
i = 1 in view of the definition of T. Thus yı ¢ X and similarly yn ¢ X. 

Now yı € A*X*. Choose i € {2,...,n} minimum with y; ¢ X. Then y; is in X*A*, 
hence yı --- y; € A*X?S A* and so is also y, contradiction. Thus y € Y. This proves the 
inclusion. 

Conversely, let y € Y. If y € X*, then y € X and hence y € Y’. Suppose now that 
y ¢ X*. Then y E€ X° A* N A*X®, since y € M. 

If we assume that y € X°+1A*, then y = xzr with x € X, z € X*. We cannot have 
zr € A*X®*, otherwise zr € M and y is decomposable in M, contradiction. But y = r’2z’ 
with z’ € X*. It follows that zr is a proper suffix of z’. Since z’ is a synchronizing 
word, we obtain zr € X* and thus y € X*, a contradiction. 

Symmetrically, y ¢ A*X2°*1. Thus y € T. Suppose y € A*X2°A*. Then y = rzz'r', 
with z,z’ € X%. Since y is indecomposable in M, either rz or z’r’ is not in M. We 
may suppose that rz ¢ M. Then y = 2s with z” € X® and rz is a proper prefix of z”. 
Since z is synchronizing, we obtain rz € X*, a contradiction. Thus y ¢ W, showing 
the inclusion Y C Y”. 
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: section4bis.2 
Section 


Wiha Te is clear that a strictly locally testable set is locally testable and that the 
family of locally testable sets is a boolean algebra. Thus a finite boolean combination of 
strictly locally testable sets is locally testable. Conversely, a locally testable language 
is a union of classes of ~, and such a class is a boolean combination of sets of the 
form yA*, A*y and A*yA*, which are either strictly locally testable or complements of 
stricly locally testable sets. 


és 2.2 
Wie bey be a strictly locally testable set. Let y : At — S be the morphism 
from At onto the syntactic semigroup of Y. Let e be an idempotent of S and let 


w € y ‘(e). We may assume that w is longer than any word of the sets T, U, V, W 
defining Y by ( . Then it is easy to verify that w is a constant. 


hss 2.3 
Wie Lets be the order of Y. Let y : At — S b the morphism from At on the 


syntactic semigroup of Y. Let e be an idempotent of S and let w be a word of y~} (e) 
of length larger than s. Then for any words p,u,v,q, we have pwuwuwq ~s, pwuwq 
and pwuwvwq ~, pwuwuwg. Thus eSe is idempotent and commutative. 


acterization of strictly locally testable sets given by Exercise 2-2, Let y: At — S be 
the morphism on the syntactic semigroup of the locally testable set X*. Let e be an 
idempotent of S. Suppose that p,q,r,s E€ S are such that peq,res € y(X*). Since X* 
is locally testable, the semigroup eSe is idempotent and commutative. Thus 


-slt 
pg we need to prove only one vo aes use the char- 


peqres = peqrespeqres = pespeqreqres = pespeqres = peqrespes 


is an element of y(X*). Since y(X*) is stable, this implies pes € y(X*). Thus e is a 
constant. 


hss .2.5 
Wits Leta = (P, E) and B = (Q, F) be elementary equivalent automata on the 
alphabet A. Let R,S be matrices such that RS is the transition matrix of A and SR 


is the transition matrix of B. Let L(A) and L(B) be the sets of labels of paths in A 
and B. 

Let w € L(A). Since A is strongly connected, there are a,b € A such that awb € 
L(A). Let awb = a; ... an with a; E€ A. Then there is a path pp S pi Z... pn-1 B pn. 
For each i = 1,...,n, we have a; = u,v; with u; € Rp, , 9, and vi E Sq,»,, and thus a 
path 


UL vi U2 v2 Un Un 
Po `> qı > Pi > Q2 > «++ Pn-1 > In `> Pn 


using alternatingly entries of R and S. Then either uy = aj,...,Un = Gn and vy = 
... = Un =loruy =... = Un = 1 and vj = a4,...,Un = Gn. We consider the first case, 
the other one being symmetrical. Since SR is the transition matrix of B, we have in B 
a path q1 3 q---dn—1 3 qn. This shows that wb € L(B) and thus w € L(B). 


M and N be the transition matrices of A and B. Let R be the Q x Q-matrix defined 
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for p,q € Q by 
Rpq = {a € A| there is an edge (p,a, q) in E} 


(note that R is well-defined because of the definition of an input merge). Let S be the 
Q x Q-matrix defined by 
1 ifp=@ 
Spg = 


0 otherwise 


Then M = RS and MN = SR. Thus A and B are elementary equivalent. 


chaptlexé#bss .2.7 
Witter bet 4 = (Q, E). We define for 1 < i < £, A; = (Qi, Ei) where Q; is the set 
of pairs (u,q) with u € A’ and q € Q such that there is a path labeled u leading to 


qin A. There is an edge labeled a from (u, p) to (v,q) in A; if u = bu’, v = v'a for 
a,b € A and (p,a,q) € E. For j =1...,r, let Ar; = (Qe+;, Ee+j) where Qe; is the 
set of triples (u, q, v) € Af x Q x AÏ such that there is a path labeled u leading to q and 
a path labeled v leaving q in A. There is an edge labeled a from (u,q,v) to (w,r,t) if 
and only if u = bu’, w = u'a, v = av’, t = v'c and (p,a, q) € E. 

For 0 < i < 4—1, the equivalence 6;,1 on Qi+ı defined by (u, q) = (u’,q’) if q = q' and 
u, u’ differ at most by the first letter is an input merge. Similarly, for £ < i < ¢+r-—1, 
the equivalence 0; defined by (u,q, v) = (u',q', v") if u = u’, q = q' and v, v differ at 
most by their last letter is an output merge. This shows that conditions (ii) and (iii) 
are satisfied. 

Finally since A is (4, r)-local, in a state (u, q, v) of Ag4r, the state q is determined by 
(u,v). Thus condition (iv) is also satisfied. 


Ve treat the case of an input merge. The case of an output merge is 
symmetrical. Let By = (Q2, F2) be a complete local automaton containing Ag = 
(P2, E2) as a subautomaton. Let F = Fs \ Eo, Q1 = P U F and let Bı = (Q1, Fi) be 
the Nasu embedding of A; with respect to Ag, Bo, bisae 

Let R, S be the matrices defined as in Exercise Bot We tave a € Rggq if (p,a,q) E E 
and Spg = 1 if p = q. The additional R-edges are the triples (f,a,q) such that 
f =(p,a,q) is in F. The additional S-edges are the triples (p,1, f) such that f begins 
with p. 

Let m be the map defined on the set P of paths of By of length at least 2 as follows. 
For (e€9, €1,---,@€n) with e; = (qi, Qi, qi+1), we define 7(€9, €1,.-.,€n) = (90,91,-+-)9n—1) 
with gi = (pi, ai, pi+1) defined by p; = q; if e; € E2 and p; = e;41 otherwise. 

We claim that m is a surjective map from the set P onto the set of nonempty paths 
of B. We have to verify that each g; is an edge of B1. We distinguish four cases. 

(i) If pi, pi4a € Pı. Then g; is an edge of A; and thus of B1. it is a RS-edge. 
(ii) If p; € Py and p41 € F. Then p; = G and pj41 = e442 . Thus g; is an RS-edge. 
(iii) If pi, pi+ı are in F, then p; = ej; and pj41 = ei. Thus gi = (pi, Qi, Pi+1) is a 
RS-edge. 

(iv) If p; € F and pj, € Pı then p; = ej41. Thus g; is a RS-edge. 

Conversely it is clear that for any nonempty path c in Bə there exist c’ € P such that 
ne) =e 

Let n be the order of By. Let c: p > q and d : p! S q' be two paths in Bı with 


|w| =n. Then there exist paths d: r 5 s 5 t and d' : r! S s! “ t such that m(d) = c 
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and 7(d’) = d. Since B2 has delay n, w is a constant for By. Thus we have also paths 
e:r 53 St and e:r’ S s&t. It is easy to verify that z(e) is a path from p to q’ 
and z(e’) a path from p’ to q. Thus w is a constant for By. 


4bis.2.7 
i et A be a (A,r)-local automaton of delay n. By Exercise pa there is a 
sequence Ag, A1, ..., Ag4r such that A = Ao, Agi, is a subset of the free (£, r)-local 


automaton and each A; is a merge of A;i+1. Since Appr is in ludęd in a complete (£ r)- 
local automaton, we may build using repeatedly mercei g Sce Bear, ... , B1, Bo 
of complete local automata with delay n such that A; is contained in B; for i = £ + 
r,...,0. Thus B = Bo is a complete local automaton with delay n containing A. 


chapter5 
Chapter VHT 
. section5.0 
Section i 


such that vrw € X*. Since u € U, there exists y € X* such that uvrwy € X*. Since 
X* is right dense, there is s € A* such that wys € X*. Finally, since uv € U there is 
z E€ X* such that uvrwysz € X*. Finally, since X* is right unitary, we have sz € X*. 
Thus vawysz € X* with wysz € X* and this shows that v € R, completing the proof 
of (a). 7 

(b) The fact that Y is synchronous results from Proposition trie 

(c) Let z € Z™* and let x € X*. Since Y’ is synchronous, there exists y € X* such 
that zxy € X*. Thus z € Z*. 

(d) Let first r € R. We may restrict to m = y(x) with x € X*. Then, there is y € X* 
such that rey € X*. Thus 1-ray = 1 and thus, by maximality of Ker(y(x)), 1- rz = 1. 
Conversely, if r satisfies the condition, let x € X*. Let y € X* be such that y(zy) is of 
minimal rank. Then 1- ray =1 and thus r € R. 


. section5.1 
Section 


sl 
is follows from Theorem ep applied to the subset of the alphabet 
formed of letters a € A such that y,(a) is invertible. 


Witte 2A The conditions are sufficient. Indeed, let w,w’ € Y*. Multiplying by an 
element of ~ +(e), we can choose w,w’ out of F(Z). Then the Z-interpretations of 
w,w’ are X interpretations. Thus y(w) = y(w’) iff p(w) = y(w), showing that G and 
Ge are equivalent. 

Let y, H and Z be as above. Since Z is recognizable, it is thin by Proposition I.5.2. 
Let w ¢ F(Z). We can suppose that w(w) = e. Let U be a set of words such that Y(U) 
generates G and let Y = Uw. The set X = Z N F(Y*) is finite because X C F(Y?). 
All three conditions are satisfied and thus the conclusion follows from the above. 
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g section5.2 
Section 


chapteešo5.2.1 _ 
et w € y '(e)M F(X). Suppose that p: wtw = p' - wuw Æ Ú. Since 


w € F(X), we have w = uv = u'v' with v,v’ prefixes of X and p: u =q-u=1 and 

1-vtw =1-v’rw. Since one of v,v’ is suffix of the other, this forces v = v’ and thus 
/ 

P=P. 


: section5.3 
Section 


Witte a Tet d be the degree of X. If w € Hx, then w has d interpretations w = s;2;p; 
with s; € ATX, x; € X* and p; E XA~. Thus ppw) = ud (Xs7',s:) x (pi, p7 X) 
which shows that yp(w) has rank d and thus yp(w) € Jp. 

Conversely, if w € H(X), let u,v € A be such that uwv € X. Then the row of index 


(u, wv) of y is reduced to {(uw,v)} and is thus not maximal. Thus yp(w) ¢ Jp 
by Exercise [VIBA 


.3.2 
e states 1,1-a,...,1-a”"~! are the fixpoints of the idempotent in y(a*), 
which has thus rank n. If n > d(X)+1, the idempotent in y(a*) is not in the minimal 
ideal of ~(A*), which is therefore not nil-simple. 


.3.3 
Witte sy Tet B= {ae A|aA*nX* #0} and C= {a € A | A*a N X* ZO}. Then 
the submonoid bA* N A*C is geberated by a code Z such that X C Z*. Then each word 
in Z* has a power in X* and thus X = Y o Z with Y elementary biprefix. 


.3.4 
VAA The sequence of equivalences 9; defined by 


p = q mod 8; iff Vw € A’ p-w=q-w. 


with 0o being the equality is increasing. If 0; = 6:41, then 6; = 6;,, for all k > 1. 
There is an 7 such that 0; has one class. The smallest such integer 7 is the depth d 
of y(At). This forces the sequence 69,...,94 to be strictly increasing from ĝo to 0a, 
whence d < Card(Q) — 1. 


3.5 
Writ Eet weéewt(J). Then for any L, L’ € A, L-w = L’ w. Thus w has rank 
one. Conversely, if w has rank 1, then it is in wy +(J). Thus wy 1(J) =~ 1(1). Asa 


direct consequence of Exercise 3.4, the depth of y(AT) is at most Card(A). 


. ction5.4 
Section 


.4.1 
Write T Tet j = i(u *a a). There is a path labeled ua* from i to j - a* If this path 
does not pass by 1, the finiteness of X imposes j +k >i+1. 

.4.2 

X is a group code, we have k = 0 and by the previous exercise, for each 

letter b € A we have i -b > i+ 1 for all į except one. This forces X = At, 
-4.3 
With k = 1 and u = b, we obtain i(b*, a) > i for all i provided a'—!ba does 
not have a prefix in X, i.e. except when a'™tb € X. 
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4.4 
et 7 € G not a power of a. Let |d] = {1,2,...,d}, E = {i € [d] | im < i} 


and F = |d] — E. Let d— 1 = ku +v withu>0and0<v<k. Let N be the set 
formed of the (u — 2)k first elements of F ordered by increasing value of im — i. Let 
L +... + Iu—2 be a partition of N in consective intervals with respect to the value of 
im —1. Let us show by induction on r, 1 < r < u — 2 that for each i € Ip, in —i >r. It 
is true for r = 1. Suppose now that the element j of I, with minimal value of iz — 7 is 
such that jm —j < r—1. Then by construction, we have im — i = r—1 for each i € Ip—1. 
But then 7 coincides with a”! on the r + 1 elements of J,_; U j, which implies by 
definition of k that m = a’~!, a contradiction. Thus 


u—2 u—2 
S = > > 22 —i)> 2 = k(u — 1)(u — 2)/2. 


On the other hand 


S =X (i —in) < (2k + 1)(d— 1) 
i€E 


Comparing the two inequalities, we obtain k(u — 1) (u — 2)/2 < (2k + 1)(d — 1). Since 
d—1 < (u+ 1)k, this implies k(u—1)(w—2)/2 < (2k+1)(u+1)k or (u—1)(u—2)/2 < 
(2k+1)(u+1). Since (u—1)(u—2) > (u+1)(u—5) for u > 0, we obtain 2(2k+1) > u—5 
and finally d < 4k? + 8k +1. 


We, denote, > usual i = 1 - at. Thus a = (12...d). Pega! to 
14-3, one has a! € E(X). Thm the permutation 7 = w *a af is defined by 
i- wa’ = ir -aë for 1<i < d. Let o = ra*. Then io = i- attwat. There are exactly 
2k values of i such that a’~!wa* has a wie in X. And on a’~!wa? is a prefix 
of X and i is an,e cedapice of o. Thus ø has at least d — 2k excedances. This implies, 
by Exorcist HERS te c belongs to the subgroup generated by a. 

2. We show that if X* contains a word t of length at most k, then it contains all the 
conjugates of t. This is a contradiction since all the powers of t would have k < d 
interpretations. Let t = a,---ag with a; E A and £ < k. We show by descending 
induction on i that t; = a;---aga,---aj_1 E X*. Assume that t;_; E€ X*. We apply 
statement 1 with a = a;_2 and w = t;_;a*-*. Thus m = t;_,a*~* x, af is in the 


euperoUD generated by a = (12.. a, Since 1m = 1- a*~*, we have 7 = a*~*. Thus 
1-t;-29a4 = 1- at;_2a%! = 2-t;_1a%! = 1. This shows that t;-2 € X* and concludes 
the proof. 


i he depth of the syntactic semigroup of X* is at most equal to ke. 
Let Y be the a of the right ideal E(X). For any a € A and y E.Y, th Pgrmutation 
o = (ay *, a") has at least d — 2k — 1 excedances. By Exercise E Í I = i this implies 
that ø is the subgroup generated by a. Since ay *aa" = (axa y)(y*a a") and since G(C) 
is generated by the permutations a *a y for a € A and y € Y, we obtain that G(C) is 
cyclic and thus that X = Aĉ. 
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i section5.5 
Section 


vitet T can be verified that the conditions stated on 8 and y are equivalent to: 
1. for each i 4 1871, one has iG > i. 
2. y is an n-cycle such that iy < i + 2 for all 2. 
3. for all i 41871,16-'y~!, one has iy > i. 

and that in turn, these conditions are necessary and sufficient for the code to be finite. 


chapt 05.5.1 
Wittie 3 We use the following facts concerning the group PGL2(5). It is sharply 3- 
transitive on 6 points, of order 120 = 6x5x4. As an abstact group it is isomorphic with 


the symmetric group S5. Let a = (123456), G = b*,a, y = b*ab. Since all the elements 
of order 6 of PGL2(5) are internally conjugate, we may suppose that the identification 
of {1, 2,3, 4,5 6}, with the projective line Z/5Z U œ is the same as the bijection p used 
in Example. A, yI ha? = (0001423) realized by the homography ¢ +> 2/(¢ + 2). By 
Exercise [VIMI6.If Sand y are such that 8 = (i, --- ip) with i1 <... < ip and y = a7 
where 7 is a product of cycles of the form (k,k +1,...,k +m) with k8 >k+m. 

If 6 has no fixpoints, then 6 = a. The permutation y is conjugate of œ by an 
involution which is a product of two cycles. The only solution is y = (132546). This 
gives the finite maximal bifix code X, of Example 5.3. 

If G has one fixpoint, then it coincides with @ on four points, which is impossible. 

If 8 has two fixpoints, these cannot be consecutive since otherwise 8 would coincide 
with a on 3 points. These two points cannot either form an orbit of a3, since otherwise 
a? would 8 would commute with Q3, in contradiction with the fact that the stabilizer 
of two points is, in PGL2(5) its own centralizer. Thus, the possible sets of fixpoints 
are (2,4), (3,5), (4,6), and (2,6), corresponding to 


Bı = (1356), B2 = (1345), 83 = (1235), 84 = (1246). 


Each of them generates, together with a, the group PGL2(5). As for y, we have y = a7 
where 7 is a product of two transpositions. This gives the two solutions 

1. y = (132546) with 7 = (23)(45) compatible with 6 = 84. 

2. y2 = (124365) with T = (34)(56) compatible with 8 = G2 or B = Bs. 
Thus, in the case where 8 has two fixpoints, the code X is one of the five possible 


1. The code Xə corre ding to 8 = 6, and y = yı whose minimal automaton is 
described in Table Fae 


9 10 
6 5 
10 6 
Table 12.8: The transitions of the minimal automaton of X3 


2. The code X3 = Xı symmetric of Xı by the exchange of a,b with 8 = b2, Y = 7. 
3. The code X4 = Xə which is the reversal of Xə with 8 = 83 and y = 72. 
4. The code X4 with 8 = 64 and y = a. 
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1;1 0 0 
2/0 1 0 
3/0 0 1 
4/1 1 0 
5/0 1 1 
6;1 1 1 
7/1 0 1 
Table 12.9: The vector space (Z/2Z)? 


5. The code Xə with 3 = 84 and y = 71. 
Note that X; = X4. 


5.2 
Witte 3 We identify (Z/2Z)? wit 46933 4956, 7} using the rests of x’ modulo 
1+a+2?. This is represented in Table . In this way, the permutation a = (1234567) 
corresponds, via the identification to the matrix 


0 1 0 

0 0 1 

1 1 0 
which represents the multiplication by x in the basis 1,xz,xz?. The group G(X) is 
generated by a = (1234567) and the permutations: 


b xa a? = (1236)(45)(7), b *q ab = (146)(235)(7), b xa b = (1254376) 


correspond, via the identification, to the matrices: 


010 110 010 
bxga7=|0 0 1 |,bxaab=| 0 O 1ļ|,bxab=|0 1 1l, 
1 1 1 0 1 1 1 0 1 


5.2 
PES Tie images of a?,ab,b are of minimal rank and thus the group G(X) is 


generated by a = (1 2---11), 8 = b *a aĉ, y = b *a ba and 6 = b *a b. We compute from 
the transitions of the automaton 


B (1236547 10)(8 9)(11) 
y (1479638 10)(2 5)(11) = 8a? 67ta! 
6 = (1256349871110) = Gas 


Let us show that a and 8 generate the Mathieu group M11 (see the Notes for a refer- 
ence). Let h(x) be the polynomial with coefficients in F3 = Z/3Z 


h(x) = -1 +r? — r? + rf +r”. 


The polynomial h(x) divides x1! — 1 and the ideal H of Fz|z]/(x1! — 1) generated by 
h(x) is a subspace of dimension 6 of F}!. The group of permutations of {1,2,...,11} 
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which leave invariant the support of the vectors in H (i.e. the set of coordinates with 
nonzero coefficient) is the Mathieu group M11. A basis of the orthogonal of H is made 
of the rows of the matrix K below. 


10000 1 -1 -1 -1 1 0 
01000 0 1 2) = -1 1 
K=|0 0100 -1 1 -1 0 21-1 
PERERA] 
Lora a a 


The colums of K are the components in the basis {1, £, €?, £3, £4} of the powers of a 
root € of the polynomial h(x). Thus the group Mj, contains a and also 8 whose action 
on the columns of K corresponds to the matrix 


0 0 -1 -1 


0 0 -l 1 -l 

0 0 1-1 0 
One may verify that a, 8 generate G by showing that they they generate a 4-transitive 
group. 


chapter5bis 
Chapter i XI 
g section5bis.2 
Section B 


there is exactly one pair (l,r) € L x R such that i = +r mod n. Since0< +r < 


2n — 2, we have actually l +r = i or l+r =i+n. Thus afa" = at or afa” = ata”. This 


shows that 


Ge =1+a+... +a! moda”-—1 (12.9.6) 
and thus ařa®(a — 1) =0 mod (a” — 1) as announced. 


isner 


E3 het m,n > 1 be integers. We show that if n is not a Hajós number, then 
neither does mn. Let G = Z/mnZ and H = {0,m,...,(n —1)m}. Thus H is a 
subgroup of G and H ~ Z/nZ. Let H = K + L be a factorization of H where neither 
K or L is periodic. Let M = {0,1,...,m—1} and N = L+ M. Since M is a 
set of representatives of the cosets of H, G = H+ M is a factorization of G. Thus 
G = K +N, with N = L + M, is a factorization of G. We show that N is not 
periodic. Assume by contradiction that p is a period of N and consider i € M. We 
have p+i+LCp+N=N andp+i+l0Cp+i+H =j +H for some appropriate 
j € M. Thus p+i+Ll C NA(j+H)=j+L. Since L is not periodic, we have p+i = j. 
Thus we have proved that p+ M C M, a contradiction since M is not periodic. 
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iia fosHasRedei 


e proof is by induction on n. Let G = Z/nZ and let G = L+R bea 
factorization. Since n is Hajós number, L or R is periodic. We may suppose that R is 
periodic. Then, we can write R = H +S where H is a nontrivial subgroup of G and the 
sum is direct. We have a factori tion G/H = (L+H)/H+(S+4H)/H. Since G/H has 
the Hajós property by Exercise ert eas the Rédei property by induction hypothesis. 
Thus either ((L + H)/H) # G/H and thus (H) # G or ((S+ H)/H) # G/H and thus 


(R) #G. 


suppose that L C [0,d] for some d > 0. Let x,y € Z with x < y be such that 
RO [2,2 +d] = [y, y +d]. Then y -— z is a period of R. Thus R is periodic of period at 
most 2%. 


g section5bis.3 
Section 


bast orGene : 
XE Let m > 1 be such that x = b™ is not a proper factor of a word in X. Then, 
since b € X, the pair (x, x) is synchronizing. Suppose that £ € L, mB is apt OX # Í. 


Then afr € X* and thus £ is in the set L(x) defined in Proposition B-60. Conversely, if 
LE L(x), then £ = kn + l' with a b+ N X Æ Ø. Thus the set of residues modi lo n of 


L and L(x) are i same. ‘The same holds for R and R(x). Thus Theorem ollows 


from Proposition [2.6, 


2.3 Le X SA oY be a finite maximal code containing Y. Let L,R be as in 
Proposition B.J. We cannot have X N (a*b* U b*a*) = Y N (a*b* U b*a*) since other- 
wise Card(L) = Card(K) Card(T) = tCard(T) and Card(R) = Card(#) Card(S) = 
dCard(S). The pair (L, R) would thus be a dt-factorization of Z/nZ and thus not 
an m-factorization. Assume first that there is an x € R Let us show that for every 
x > 0, the multiplicity of z in L + (RU zx) is larger than m. Indeed, since (S,T) is a 
factorization of Z/nZ, there is a pair r,l € S x T such that z =1+r mod n. Thus 
the value x is represented mod n in t ways as the sum x +n and in dt ways as the 
sum l +r. Thus the multiplicity of x is dt +t >m. The proof that the same property 
holds for (LU x) + R is symmetrical. 


a U aba” is composed of a prefix and a suffix code. Consider next an integer n > 2. 
Since n has the Hajós property, either L or R is n-periodic. We may assume that L is 
n-periodic of period p. Then n = pq and L = L’ + {0,p,...,p(q—1)}. The pair (L’, R) 
is a factorization of Z/qZ. By the induction hypothesis, the code Z = a4U alba? is 
composed of prefix and suffix codes. Then X C a” U {e,a?,...,a?4-))}(Z — a®) has 
the same property. 
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chapter6 
Chapter Ri 
ction6.1 
Section il 


: Assertion (a) is clear since a series has an infinite radius of convergence iff 
it is a polynomial. To prove (b), we consider the minimal automaton A = (Q,i,T) 
recognizing X. Let P be the Q x Q matrix defined by Pp = aa m(a). Then 
(I — Pt); = fx(t). Thus px is the inverse of the dominant eigenvalue of P and it is 
a pole of fx(t) (actually, the minimum of the modulus of all poles of fx(t)). 


. Isection6.2 
Section 


JE 


XB-T 
X. Let P be the Q x Q matrix defined by Ppa = X, 


6.1.1 
We consider, as in Exercise Mithe minimal automaton A = (Q, i, T) recognizing 
(a). Then (I— PH = fx (t). 


p:a=q m 


The dominant eigenvalue of P is px. Let v be the corresponding row eigenvector 
normalized by ` qeq vq = 1. Let (u) be the sequence of row vectors defined by 


ut) = u™ P with u the characteristic vector of the initial state i. Then fn = 


er ul”), The sequence (un) converges in mean to v and thus the density of X is 
O(X) = Vier Ut: 


wm) xr®(L) = 5 mw) xr (u) = 5 5 n(v)n(a) 


uel uc LNAnt! va=u 
= 5 ru) =a (1). 
ueELNAnrtt 


(c) Let pin = 2 72%) 7 yp-!. Then v = lim pp, and thus 


v xv = (lm un) * (lim um) = ln ining fH V 


HEMO e pypotheses imply that each X, is a thin maximal prefix code. Thus, 
by Theorem we have 6(X;5) = 1/XA(Xp). For p,q € Q, denote Ly, the set recognized 
by the automaton (Q, p,q). For each q € Q, the sets Lpg and Up.q=raLrg differ at most 
by the empty words and thus 6(Lpq) = X p.a=r T(@)6(Lrq)- Since 6(Lpq) = A(X} ), this 
shows that the distribution given is indeed stationary. That the stationary distribution 
is unique is a well-known fact. Recall that if P denotes the Q x Q-matrix defined by 
Poa = ip-azq T(G), and y a stationary distribution, we have yP = y and thus yP” = y 
for all n > 1. Thus, by taking the limit, we have for all q € Q, X peo V(P)6(Lpq) = 1() 
and thus 7(q) = 6(X7)- 

(b) We have -eo (9) = X res 7) = 1 and thus condition (i) is satisfied. Next, 


Y dy(p)m(a) =Y Y orla) = So oJ) = dyla) 


p-a=q qEJ I-a=J qEJ 


showing that condition (ii) is also satisfied. 
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i section6.3 
Section 


i ) and S(X) 
are non empty and equal (see Exercises and (ii) are 
equivalent. 

If C (w) is maximal, then w € S(X). Indeed, suppose that rwv € X* for some 
x € X*. Since C,.(w) C C (xw), we have C,.(w) = C,(aw). Thus wu € X*. 

If C, (w) is not maximal, then w ¢ E(X). Suppose indeed that C, (w) C C(u) 
with v € C,(u) — C,(w). Let s € S(X) and suppose that for some t € A*, we have 
wust E€ X*. Since C,(w) C C(u), we have uvst € X*. Since v € C,(u) we have 
uv E€ X* and consequently st € X*. Let v’ € C, (w) be such that vst = v'x with 
x € X*. Then uv'x = uvst forces v = v' by unambiguity, a contradiction. Thus there 
is no t as above and w ¢ E(X). 


We first verify the following property (*): if vuz = v'u’ for v,v’ € C,(w), 
u,u' € U, and z € A* ‚then v =v’, u = u’, and z = 1. 

Indeed, first note that u € E(X). Thus, there exists t € A* such that uzt € X*. 
Then 

(wv) (uzt) = (wv')(u't). (12.9.7) 

Each one of the first three parenthesized words is in X*. Now the fourth word, namely 

ti also in X*, because u’ is simplifying. The set X being a code, the word in 
(PSF has a unique factorization in words in X, starting with wv, and also with wv’. 
It follows that y = v'y or v! = vy for some y € X*. This implies that v = v’. Going 
back to poe this gives uz = u’. Now U is prefix. Consequently z = 1 and u = u’. 
This proves property (x). 

It follows immediately from (*) that C,.(w)U is prefix, and also, taking z = 1, that the 
product C.(w)U is unambiguous. This proves (a). To prove (b), we show that C,(w)S 
is right dense. For this, consider a word t € A*. The word wt is right completable, 


| l 
x; KE A Yp 


1 


| 
l 
| 
’ | 
PN Ww sin Vv nm y ò 


Figure 12.16: Factorization of wv = w'y. 


since w € E. Thus, wtt’ € X* for some t € A*. Thus, tt’ is in w~!X*. Consequently 
tt! = vy for some v € C,(w), y € X*. Now observe that w € E, and consequently 
also yw € E. Thus, tt’w = vyw € C,(w)S. This show that C,(w)S is right dense. 
From C,(w)S = C,(w)U A* it follows then by Theorem at the prefix set C,.(w)U 
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is maximal prefix. 
(c) Since DINK = E(X)NK, we have 6(Dx) = 0(E(X) = a(U). Thus 7(C;(w))d(Dx) = 
m(C,(w)a(U) = 2(C,(w)U) = 1. 


a a(1) = L Te ige 
R3 We have 7°(1) = o) ex TA (uv) = Nay Xex zlta (x) = 1. Thus 


condition (i) is satisfied. Next, 


`> m7 (wa) XG) > ra” (uwav) 


acA (u,v)EC (wa) 
= xo! 5 mal) (uwv) + `> ra” (uwz) 
(u,v)EC(wa),vA1 (u,1)EC(w) ,wEexX 
1 
= a. >» mal) (uwv) + > ral) (uw) = r° (w). 
(u,v)EC(wa),vA1 (u,1)EC(w) 


A symmetric argument shows that Jaca 7° (aw) = 1°(w). The contextual probability 
corresponds to the case where m is a Bernoulli distribution on B*. 


Chapter7bis 
Chapter ki 
p section7bis.1 
Section i 


cha bås1 
e unique factorization of a word w € {1,2,...,n} is obtained as follows. 


tee i be the least letter of w and let w = uiv where all laien of u are at least equal to 
i+1. Then iv € X;. We factorise in the same way u and obtain the factorization of w. 


i e factorization of a word w = a1a2: -an corresponds to the convex hull of 
the graph of points (i, (a1 ---a;)). 


RIES Tot m = fil2-- -ln be the factorization of m in a nonincreasing product of 
Lyndon words. Arguing by contradiction, suppose that n > 1. If < 41, then 44 € L, a 
contradiction with the definition of £. Thus £ > £44, showing that w has a nonincreasing 
factorization in Lyndon words of length n + 1, a contraction with the fact that w € L. 
Thus n = 1 and m €E L. Since l < w and w < m, we have also l < m. 

If l < p, then lp € L and | is not the longest proper prefix of w which is in L. Thus 
pl. 


agter bá s4 
XII.41 We show by induction oni > 1 that Z; contains all z, such that m (zr) = (Zs, zt) 


and s < i < r. It is true for i = 1. Suppose that it is true for j < i — 1 and consider zy 
such that m(zr) = (Zs, zt) with s< i <r. Ifs <i-— 1, then z, € Zi—ı by the induction 
hypothesis, and thus z, € Z;. Otherwise, m(zr) = (z;-1, 24). Suppose first z, € A. Since 
r < t, we have z; € Z; and thus zp € Z;. Otherwise, let 7(z:) = (Zu, 2v). By the previous 
exercise, we have u < s and thus u < i. We can thus repeat the same discussion with 
z replacing zp. Iterating this argument, we can suppose that z, = z% 1z with k > 0, 
t—1l<tand x E€ A or T(z) = (Zu, 2y) with u <i—1. We have, as above, z € Z; and 
thus zp € Zi. 
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g section7bis.2 
Section 


cha bast 
e prove the statement by induction on n. Let A* = X*_,---Xf bea 


factorisation obtained by composition of bisections and X; = Y*Z* be a bisection of 
X;. Then, by induction hypothesis, X; is a (i—1,n—i—1)-limited code. We consider the 
factorization A* = Y/---Yj' with Yn = Xn-1,..., Vise = Xin, Yin = Y/Y; = Z and 
Yio = Xi-1;---, Yı = Xı. Then Y; is an (j — 1,7 — 7)-limited code for 1 < j < i— 1 
and for i +2 < j < n. Let us show that Yii1 = Y is (i,n — i — 1)-limited. Let 
UQ,+++,Un—1 be such that uj—1uj € Y* for 1 < j < n — 1. Since Y C X¥ and since X; 
is (i — 1,n — i — 1)-limited, we have u;_1,u; E€ X;. Since Y is (1,0)-limited, we have 
ui € Y*. Thus Y is (i,n —i—1)-limited. The proof that Y; = Z is (i — 1,n — i)-limited 
is similar. 


RE The submonoid M satifies C (1,0) and thus U is (1, 0)-limited. Consequently, 
there exists a bisection of the form (U, Z). Let u,v € U* be such that uv € X*. 
Let u = u1 :-- Un with u1, U2,...,Un suffixes of X. Since X is (2,0)-limited, we have 
successively uz: Unu E X*,...,unv E X*, and finally v € X*. Thus, considered as a 
code on U, X is (1,0)-limited, which implies the existence of a bisection (X,Y) of U*. 


RIRS An easy inspection shows that Y is (1,1)-limited. Suppose that (X,Y, Z) 
is a trisection of A*. Since ged € Y and since X*Y* is suffix-closed (by Proposition 
5.6), ed € X*Y*, which implies ed € X. Similarly, since dac € Y and since Y*Z* is 
prefix-closed, da € Z. But then eda € X? N Z?, which is impossible. 


KIEA The submonoid M generated by the suffixes of y clearly satisfies the condition 
C(1,0). Let X’ be the code generating M. Since X’ is (1,0)-limited, there exists a 
bisection of A* of the form (X’,Z). Since y is unbordered, we have y € X’. Thus 
X™ = X*y* with X = y*(X' — y). 


chapter8 
Chapter KH) 
. section8.1 
Section fi 


A word x € X* as in the statement may be called separating . (a) A 
separating code is complete and synchronous since for any w € A*, one has rwg € X*. 

(b) Let P be the set of right contexts of x and let S be the set of left contexts of x. 
Then A* = SX*P unambiguously. Suppose that A* = S’X*P’ unambiguously. Let us 
first verify that the product S’X*P is unambiguous. Suppose indeed that syp = s'y'p' 
for some s,s’ € S’, y,y’ € X* and p,p’ € P. Then sypz = s'y'p'x are two factorizations 
in S’X* which is unambiguous and thus s = s’, yp = y'p'. Since X*P is unambiguous, 
y=y' and p= p. 

Let now R be the set such that A* = S’X*P+ R. Then SX*P = S’X*P+ R and 
multiplying on the right both sides by (1 — A)S, we obtain 


S! = S — R(1 — AJS. (12.9.8) 
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One can show symmterically that the product $X*P’ is unambiguous and that the set 
T such that A* = SX*P' +T satisfies 


P' = P — P(1 — ADT. (12.9.9) 


Substituting the expressions of P’ and S’ given by Equations IEEE) and IEK) in 
S'X* P' = SX* P, we obtain 


JEP = (2nd -ADX P- Pris 
SX PeR- TRU =A 


Thus R+T + RAT = RT which forces R = T = 0, by considering the terms of lowest 
degree of both sides. Thus S’X*P = SX*P and SX*P' = SX*P which implies P = P’ 
and S = S. 


g T ais the conditions, for any word w € A* there is a path 1 > p = 
q Š 1. Then p is in U(x) and q is in V(x). The hypothesis on U (æ), V(x) implies 
that w = wv with p = 1 q, showing that xu, vr € X*. Thus X is separating. The 
converse is clear. 


be a separating box. Let P be the set of right contexts of words 
in L and let Q be the set of left contexts of the words in R. Then A* = PX*Q 
unambiguously. 

68 .Obis.1quatro 
A uppose that S,T satisfy the PyDOUIESA For w € A*, there is a unique 
pair (s,t) € S x T such that y,(swt)i, = 1, and thus such that there is a path 


lpg +, 1. Since ~(s)ip = 1, p is in the set £. Since y(t)gi = 1, q is in c. By 


condition (ii), we have w = uv with p > 1 q. We obtain su,vt € X*. Thus S,T is 
a separating box. 
The converse implication is similar. 


(resp. Q) be the set of left (resp. right) contexts of b. Then A* = QX*P 
and thas = = P(A-1)Q = X' -1 + PdQ. 


chaptlex88 .Obis.3 
RHI One has a” —1= P(a—1)Q if and only if PQ=1+a+...+a™1. 


(b) Let V be the code defined by V — 1 = Q(A—1)R. We have 


T-1 = P(A-1)R+wQ(A-1DR 
A= ey =e 
Since w is of maximal length in Z, T is unambiguous. 
(c) We first show that T is uniquely factorizing. Suppose that T — 1 = F(A —1)G. 


Let n = |w| and m be the maximal length of words in G. It is possible to show that, 
for all f € F, |f| +m+1>n implies f € wA*. 


Version 22 janvier 2008 J. Berstel, D. Perrin and C. Reutenauer 


Solutions of Exercises 521 


This is shown by descending induction on the length of f. If f is of maximal length, 
then fAg C wV for |g| = m and thus f € wA*. Consider next f € F,a € AandgeéG 
such that |fag| > n with |g| = m. We first rule out out the case |f| < n. If this were 
the case, we first suppose that fag E€ wV. Then, for b Æ a, we have fbg ¢ wV and 
thus fbg = fig: for some fı € F and gı € G. Since |g| is maximal, we have | fi] > Jf], 
whence fı E€ wA* by the induction hypothesis, a contradiction. Suppose next that 
fAgQwV = 9. Using the same argument as above, we conclude that fa and fb are 
prefixes of w for a 4 b, a contradiction. Thus |f| > n. If fag ¢ wV, then fag = figi 
for some fı € F and gı € G. Then |f;| > |f| implies fı E€ wA* by induction hypothesis 
and finally f € wA*. 

Let Fı be the set of f € F such that |fag| < |w| for all a € A and g € G and let 
F; = F — F,. Then, as we have seen, F} = wF, and F} AG N wFzG = {w}. We thus 
obtain P(A — 1)R = Fı (A — 1)G and Q(A — 1)R = Fə(A — 1)G. Since Z is separating, 
it is uniquely factorizing, and thus R = G. Thus T is uniquely factorizing. 

The three-factor expression of T — 1 does not correspond to a decomposition of T 
since (P + wQ) is not prefix-closed and R is not suffix-closed. Since P + wQ and R 
are irreducible polynomials, these are the only possible decompositions of T. Thus T 
is indecomposable. 

(d) Z is separating. Let indeed z = b. We have for any word w € A*, wb € X*. Since 
X* = RZ*, we have either wb € Z* or wb = aav with v € Z*. In the first case we have 
b, wb € Z* and in the second one baa, vb € Z*. Thus condition (i) is satisfied. Next, 
we have Card(P U wQ) = 5 and Card(R) = 2. Thus condition (ii) is satisfied. Finally, 
R is not suffix closed since a ¢ R and thus condition (iii) is also satisfied. 


(b) We show that the code Z defined by the expression satisfies Z* = X*NY*. We 
have 


Z-1 = (1+ A+4+6?)(A—1)(a(A—1)4+1)(1+4+4 Aa) 
= (X-1)(a@(A—-1)+1)(1+44 Aa) 
(X —1)(14+ aA + ba + aA?a) 


and thus Z C X*, since Z — 1 = (X —1)P with P C X*. In the same way 


Z=1 (1+A+0*)((A—Dat+1)(A-1)(1+a+4 Aa) 
= (1+a+b++b)(1—a+a°+ba)(Y — 1) 


(1+aAa +b +b? + ba? +b? Aa)(Y — 1) 


Z—-1=Q(Y — 1) with Q c Y*. Thus Z decomposes on X and Y and consequently 
ZCX*NY*. The other inclusion follows from the fact that these are the only possible 
decompositions of Z. 

(c) Z is synchronous since X and Y are. Let x,y € Z* be such that yA*x C Z*. Then 
yA* C Y* since Y is suffix and A*a C X* since X is prefix. Consider the word zay. 
We cannot have ya € Z* (since a ¢ X*) and neither ax € Z* (since a ¢ Y*). Thus Z 
is not separating. a osoSebBor 
(d) Consider the automaton recognizing Z* represented on Figure BET Gt can be 
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a,b 
Figure 12.17: An automaton recognizing Z*. 


computed either from the list of words forming Z or using the direct product of au- 
tomata recognizing X* and Y*). Let us verify that ({b}, {1,a°}) is a separating box. 


3 
Indeed, the set of states q such that 0 ae q is Z = {1,3,6}. It is a maximal r f the 
transition monoid of the automaton appearing as the first column of Table eeo 
other maximal rows are {2,4,5} and {4,7}. Each of these sets intersects in exactly one 
point the get. {dd ee = UUige Q/¢ = 1}. This shows that condition (i) of Exer- 
cise ka is satisied for the pair ({1,3,6},{1,5,7}). It can be checked that condition (ii) 


is also satisfied and thus the pair is a separating box. The corresponding factorization 
is Z—1 = (X —1)P. Another separating box is ({1,a*,a*b}, {1,aba}. Indeed, the 


aba 


fig-exoSepBox 


set of states q such that q —> 1 is {2,7}. But the set {1,2,7} is a maximal besari 


of the transition monoid of the automaton, appearing as the first row of Table 

The other maximal columns are {3,5,7} and {4,6}. Each of them intersects in exactly 
one point the set {1,3,4} which is the set of states q such that 1 & q for u = 1,af 
or atb. Thus the pair ({1,3, 4}, {1,2,7}) satisfies condition (i). Since condition (ii) is 
also satisfied, the pair is a separating box. If corresponds to the other factorization 
Z-1=Q(Y -1). 


Table 12.10: The maximal rows and columns 


.Obis.6 
TO a) We have (omitting the underscore) 


o = (1+w)(X —1+GiwDı + Giw? — Giw + w? Di +w — w? —wD+w)++1 
= (1+w)R+ (+w) (w -w +w- 1)+1=(1+w)R+ ut 


It is easy to verify that R is a prefix code, that w is not a prefix of R, and that o is 
the characteristic polynomial of a maximal prefix code. 

The polynomial r = (X—1+(G—1)w(D—1))(1+w)+1 satisfies 7 = R(1+w)+wź4 and 
thus 7 has nonnegative coefficients. We have also r—1 = (P+(G—1)wQ)(A-—1)(1+ w) 
where P is the set of prefixes of X and Q the set of prefixes of D. Thus 7 is the 
characteristic polynomial of a finite maximal code. 
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(b) We have yy(X)Nw2.A* = w? Dı +w? Dw and yy,(X)N A*w? = Giw + Gw’. Thus 
Ywl X) N w? A* N A*w? = {w4}, which shows that z? = wt is a pure square for yy(X). 

(c) It follows from the fact that Y = (1+ w)R+w* and Z = R(1+w)+w? that for each 
y € Y*, we have either y € Z* or y = wz with z, zw € Z*. Indeed, if y = y1yo--- Yn, we 
have for each i = 1,...,n, yi € R or yj € wR or y; = wt. We then glue each prefix w 
with the previous element of the factorization, except perhaps for the first one. Thus 
a word with d disjoint interpretations in Y* has also d disjoint interpretations in Z*. 

(d) Let S be the set of suffixes of X and T be the set of suffixes of G. We have 
X —1=(A-1)S and G-—1=(A-1)T. Thus 


Y-1 = (1+w)(X-14+(G—-1)w(D—-1)) = (14 w)\(A—-1)(8 + Tw(D — 1) 
= (1+w)(A—-1L 


with L = S$—Tw+TwD. Thus, equivalently A* = LY*(1 +w) is a factorization. Since 
S is a disjoint union of d(X) maximal prefix codes and Tw C S, the set L is a disjoint 
union of d(X) maximal prefix codes. Thus any word has d(X) disjoint interpretations 
in Y*. 

(e) Let G’ = Yw~? and D' = w™?Y. We have G’ = (14+ w)Gi + w? and D' = 
(1+ w)D, +w?. Thus G’—1= (1 +w)(G — 1) and D' — 1 = (1 +w)(D — 1). We have 
then the factorization 


T-1 


l 
T 
+ 
Lae 

| 
+ 

Q 

| 
> 
& 

Q 

| 
© 


1+w)(X —14+(G—1)w(D —- 1) + (G -— 1)w(D’ — 1)) 
1+w)(A-—1)(S+Tw(1 + w + w?)(D—1)) 
(l+w)(A-—1)M. 


= (+w? 
= (a4 


where M is a disjoint union of d(X) maximal prefix codes (observe that (1 + w + 
w?)(D — 1) = E — 1 where F is a maximal prefix code). This shows that d(T) = d(X). 
By (c) we obtain the conclusion d(Z) = d(X). 

(f) Suppose that Z C V* where V is a prefix code. Fix a letter a € A. Set d = d(X) 
and let e < d be such that aê € D. Since d > 2, we have w # a. Since Z contains af 
and a“w, we have w € V*. Since Gw D — w® is a subset of Z, we have Dı C V* and 
thus af € V*. We conclude, since d is prime that a € V. The case where V is a suffix 
code is symmetric. 

(g) The set X = A2+(A—1)a"~!ba"(A—1) with d = 2n +1 is a maximal bifix code. 
The word (a"b)? is a pure square for X. Thus we may apply the above construction 
for any prime number d > 2. 


. section8.6 
Section 


.6.2 
RET Since a,c € Y, we have ba € Y. But then all conjugates of acb have a prefix 


w?)( 
= (L+w?)( 
\ 


(z) = (1 — u(z)/(1 — kz) with p(z) = po P17. Then for each n > 1 
Pn/k” = 1 — ui/k — ... — Un/k” 


whence the result. 
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